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DESCRIPTION 

Method for Diagnosing Chronic Myeloid Leukemia 

Field OF THE Invention 

The invention relates to methods of diagnosing chronic myeloid leukemia. 

Priority Information 

This application claims priority to United States Provisional Application Serial 
No.60/414,867, filed September 30, 2002. 



10 Background of the Invention 

Chronic myeloid leukemia (CML) is a clonal myeloproliferative disease characterized by 
Philadelphia (Ph) chromosome translocation (1). The resulting BCR-ABL fusion gene encodes a 
cytoplasmic protem that is constitutively activated for its tyrosine kinase activity. CML 
progresses through distinct clinical stages; the earliest stage, termed the chronic phase, is 

15 characterized by expansion of terminally differentiated neutrophils. The acute phase termed 
accelerated phase and blast crisis characterized by maturation arrest with excessive numbers of 
undififerentiated myeloid or lymphoid progenitor cells (2). Current therapies include allogenic 
stem-cell transplantation (SCT) and chemotherapies mcluding interferon-a (IFN-a (3). IFN-a 
prolongs overall survival but has considerable adverse effects. SCT is the only cxirative 

20 treatment, but is associated with substantial morbidity and is limited to patients with suitable 
donors. 

The development of the ABL-selective tyrosine kinase inhibitor STI57 1 
(imanitib; Glivec; Novartis, Basel, Switzerland) is a significant advance in the management of 
CML (4,5). STI571 frequently induces remarkable hematologic and cytogenetic responses m 
25 these clinical settings. However,, recent clinical studies with STI571 in CML demonstrated that 
many patients at the advanced stage req)ond well but then relapse (6),(7). Resistance to STI571 
because of enhanced expression or mutation of BCR-ABL gene has been found in CML patients 
(8), (9). Indeed, STI571-induced hematolo^c responses occur less frequently and are less 
durable in CML patients at the blast crisis phase. 
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cDNA microarray technologies have enabled to obtain comprehensive profiles of gene 
expression in normal and malignant cells, and compare the gene expression in malignant and 
corresponding normal cells (Okabe et al.. Cancer Res 61:2129-37 (2001); Kitahara et al.. Cancer 
Res 61: 3544-9 (2001); Lin et al,. Oncogene 21:4120-8 (2002); Hasegawa et al.. Cancer Res 

5 62:7012-7 (2002)). This approach enables to disclose the complex nature of cancer cells, and 
helps to understand the mechanism of carcinogenesis. Identification of genes that are 
deregulated in tumors can lead to more precise and accurate diagnosis of individual cancers, and 
to develop novel therapeutic targets (Bienz and Clevers, Cell 103:31 1-20 (2000)). To disclose 
mechanisms underlying tumors from a genome-wide point of view, and discover target 

10 molecules for diagnosis and development of novel therapeutic drugs, the present mventors have 
been analyzing the expression profiles of tumor cells using a cDNA microarray of 23040 genes 
(Okabe et al.. Cancer Res 61:2129-37 (2001); Kitahara et al.. Cancer Res 61 :3544-9 (2001); Lin 
et al.. Oncogene 21:4120-8 (2002); Hasegawa et al.. Cancer Res 62:7012-7 (2002)). 

Studies designed to reveal mechanisms of carcinogenesis have already facilitated 

15 identification ofmolecular targets for anti-tumor agents. For example, inhibitors of 

femexyltransferase (FTIs) which were originally developed to mhibit the growth-signaling 
pathway related to Ras, vAiOs& activation depends on posttranslational famesylation, has been 
effective in treating Ras-dependent tumors in animal models OHe et al.. Cell 99:335-45 (1999)). 
Clinical trials on human using a combination or anti-cancer drugs and anti-HER2 monoclonal 

20 antibody, trastuzumab, have been conducted to antagonize the proto-oncogene receptor 

HER2/neu; and have been achieving improved clinical response and overall survival of breast- 
cancer patients (Lin et al.. Cancer Res 61 :6345-9 (2001)). A tyrosme kinase inhibitor, STI-571, 
which selectively inactivates bcr-abl fusion proteins, has been developed to treat chronic 
myelogenous leukemias wherein constitutive activation of bcr-abl tyrosine kinase plays a crucial 

25 role m the transformation of leukocytes. Agents of these kinds are designed to suppress 
oncogenic activity of specific gene products (Fujita et al.. Cancer Res 61:7722-6 (2001)). 
Therefore, gene products commonly up-regulated in cancerous cells may serve as potential 
targets for developing novel anti-cancer agents. 

It has been demonstrated that CD8+ cytotoxic T lymphocytes (CTLs) recognize epitope 

30 peptides derived from tumor-associated antigens (TAAs) presented on MHC Class I molecule, 
and lyse tumor cells. Since the discovery of MAGE family as the first example of TAAs, many 
other TAAs have been discovered using immunological approaches (Boon, Int J Cancer 54: 177- 
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80 (1993); Boon and van der Bruggen, J Exp Med 183: 725-9 (1996); van der Braggen et al.. 
Science 254: 1643-7 (1991); Brichard et al., J Exp Med 178: 489-95 (1993); Kawakami et al., J 
Exp Med 180: 347-52 (1994)). Some of the discovered TAAs are now in the stage of clinical 
development as tar^ts of immunotherapy. TAAs discovered so far include MAGE (van der 

5 Bruggen et al.. Science 254: 1643-7 (1991)), gplOO (BCawakami et al., J Exp Med 180: 347-52 
(1994)), SART (Shichijo et al., J Esq? Med 187: 277-88 (1998)), andNY-ESO-1 (Chen et al., 
Proc Natl Acad Sci USA 94: 1914-8 (1997)). On the other hand, gaie products which had been 
demonstrated to be specifically overexpressed in tumor cells, have been shown to be recognized 
as targets inducing cellular immune responses. Such gene products include p53 (Umano et al., 

10 Brit J Cancer 84: 1052-7 (2001)), HER2/neu (Tanaka et al., Brit J Cancer 84: 94-9 (2001)), CEA 
(Nukaya et al., Int J Cancer 80: 92-7 (1 999)), and so on. 

In spite of significant progress in basic and clinical research concerning TAAs (Rosenbeg 
et al.. Nature Med 4: 321-7 (1998); Mukherji et al., Proc Natl Acad Sci USA 92: 8078-82 (1995); 
Hu et al.. Cancer Res 56: 2479-83 (1996)), only limited number of candidate TAAs for the 

15 treatment of adenocarcinomas, including colorectal cancer, are amiable. TAAs abundantly 
expressed in cancer cells, and at the same time ^ch egression is restricted to cancer cells 
would be promising candidates as immunotherq>eutic targets. Further; identification of new 
TAAs inducing potent and specific antitumor immune responses is expected to aicourage 
clinical use of peptide vaccination strategy in various types of cancer (Boon and can der Bruggen, 

20 J Exp Med 183: 725-9 (1996); van der Bruggen et al.. Science 254: 1643-7 (1991); Brichard et 
al., J Exp Med 178: 489-95 (1993); Kawakami et al. J Exp Med 180: 347-52 (1994); Shichijo et 
al., J Exp Med 187: 277-88 (1998); Chen et al., Proc Nad Acad Sci USA 94: 1914-8 (1997); 
Harris, J Natl Cancer Inst 88: 1442-5 (1996); Butteifield et al.. Cancer Res 59: 3134-42 (1999); 
Vissers et al.. Cancer Res 59: 5554-9 (1999); van der Burg et al., J Immunol 156: 3308-14 

25 (1996); Tanaka et al.. Cancer Res 57: 4465-8 (1997); Fujie et al., Int J Cancer 80: 1 69-72 (1999); 
Kikuchi et al., Iht J Cancer 81: 459-66 (1999); Oiso et al., Int J Cancer 81 : 387-94 (1999)). 

It has been repeatedly reported that peptide-stimulated peripheral blood mononuclear 
cells (PBMCs) from certain healthy donors produce significant levels of IFN-^ in response to the 
peptide, but rarely exert cytotoxicity against tumor cells m an HLA-A24 or -A0201 restricted 

30 manner in 51Cr-release assays (Kawano et al., Cance Res 60: 3550-8 (2000); Nishizaka et al.. 
Cancer Res 60: 4830-7 (2000); Tamura et al., Jpn J Cancer Res 92: 762-7 (2001)). However, 
both of HLA-A24 and HLA-A0201 are one of the popular HLA alleles in Japanese, as well as 
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Caucasian (Date et al.. Tissue Antigens 47: 93-101 (1996); Kondo et al., J Immnnol 155: 4307- 
12 (1995); Kubo et al., J Immunol 152: 3913-24 (1994); Imanishi et al.. Proceeding of the 
eleventh International Hictocompatibility Workshop and Conference Oxford University Press, 
Oxford, 1065 (1992); Williams et aL, Tissue Antigen 49: 129 (1997)). Thus, antigenic peptides 
5 of carcinomas presented by these HLAs may be especially usefiil for the treatment of carcinomas 
among Japanese and Caucasian. Further, it is Imown that the induction of low-afBnity CTL in 
vitro usually results j&om the use of peptide at a high concentration, generatii^ a hi^ level of 
specific peptide/MHC complexes on antigen presenting cells (APCs), vAada. will effectively 
activate these CTL (Alexander-Miller et al., Proc Natl Acad Sci USA 93: 4102-7 (1996)). 

10 STJMMARY OF THE INVENTION 

The invention is based upon the discovery of a pattern of gene expression correlated witii 
CML. The genes that are differentially expressed in CML are collectively referred to herein as 
"CML nucleic acids" or "CML polynucleotides" and the corresponding encoded polypeptides are 
referred to as "CML polypeptides" or "CML proteins." 

15 Accordingly, the invention features a method of diagnosing or determining a 

predisposition to CML in a subject by detennining an expression level of a CML-associated gene 
in a patient derived biological san^le, such as peripheral blood sample or myeloid cells sample. 
By CML associated gene is meant a gene that is characterized by an e:q>ression level vwhich 
differs in a cell containing a Philadelphia (Ph) chromosome translocation or in a cell obtained 

20 from an individual with a femily history of CML or an individual exhibiting clinical symptoms 
of CML, compared to a normal peripheral blood cell. A CML-associated gene is one or more of 
CML 1-296. An alteration, e.g., increase or decrease of the level of expression of 4e gene 
compared to a normal control level of the gene mdicates that the subject suffers from or is at risk 
of developing CML. 

25 By normal control level is meant a level of gene expression detected in a normal, healthy 

individual or in a population of individuals known not to be suffering from CML. A control 
level is a single e3q)ression pattern derived from a single referoice population or from a plurality 
of expression patterns. For example, the control level can be a database of expression patterns 
from previously tested cells. A normal individual is one with no clinical symptoms of CML and 

30 no family history of CML. 

An increase in the level of CML 1-190 detected m a test sample compared to a normal 
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control level indicates the subject (from which the sample was obtained) suflFers from or is at risk 
of developmg CML. In contrast, a decrease in the level of CML 191-296 detected in a test 
sample compared to a normal control level indicates said subject suffers from or is at risk of 
developing CML. 

5 Alternatively, expression of a panel of CML-associated genes in the sample is compared 

to a CML control level of the same panel of genes. By CML control level is meant the 
expression profile of the CML-associated genes found in a population suffering from CML. 

Gene expression is increased or decreased 10%, 25%, 50% compared to the control level. 
Altemately, gene expression is increased or decreased 1 , 2, 5 or more fold compared to the 
10 control level. Expression is determmed by detecting hybridization, e.g. , on an array, of a CML- 
associated gene probe to a gene transcript of the patient-derived cell sample. 

The patient-derived cell sample is any cell from a test subject, e.g., a patient known to or 
suspected of having CML. For example, the sample contains a mixture of mononuclear cells 
from peripheral blood. 

15 The invention also provides a CML reference expression profile of a gene expression 

level of two or more of CML 1-296. Alternatively, the invention provides a CML reference 
expression profile of the levels of expression of two or more of CML 1-190 or CML 191-296. 

The invention fiirther provides methods of identifying an agent that inhibits or enhances 
the expression or activity of a CML-associated gene, e.g. CML 1-296 by contacting a test cell 

20 expressing a CML associated gene with a test agent and determining the expression level of the 
CML associated gene. The test cell is a mononuclear cell such as a mononuclear cell from 
peripheral blood of CML patient. A decrease of the level compared to a normal control level of 
the gene mdicates that the test agent is an inhibitor of the CML-associated gene and reduces a 
symptom of CML, e.g, CML 1-190. Alternatively, an increase of the level or activity compared 

25 to a normal control level or activity of the gene indicates that said test agent is an enhancer of 
e3q)ression or fimction of the CML-associated gene and reduces a symptom of CML, e.g.y 
CML 191-296. 

The invention also provides a kit with a detection reagent which binds to two or more 
CML nucleic acid sequences or which binds to a gene product encoded by the nucleic acid 
30 sequences. Also provided is an array of nucleic acids that binds to two or more CML nucleic 
acids. 
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Therapeutic methods include a method of treating or preventing CML in a subject by 
administering to the subject an antisense composition. The antisense composition reduces the 
expression of a specific target gene, e.g-, the antisense composition contains a nucleotide, which 
is complementary to a sequence selected jfrom the group consisting of CML 1-190. Another 
5 method includes the steps of administering to a subject an short interfering RNA (siRNA) 

composition. The siRNA composition reduces the expression of a nucleic acid selected from the 
group consisting of CML 1-190. In yet another method, treatment or prevention of CML in a 
subject is carried out by administering to a subject a ribozyme composition. The nucleic acid- 
specijSc ribozyme composition reduces the expression of a nucleic acid selected from the group 

10 consisting of CML 1-190. Other therapeutic methods include those in which a subject is 
administered a compound that increases the expression of CML 191-296 or activity of a 
polypeptide encoded by CML 191-296. Furthermore, CML can be treated by administering a 
protein encoded by CML 191-296. The protein may be directly administered to the patient or, 
alternatively, may be expressed in vivo subsequent to being introduced into the patient, for 

15 example, by administering an expression vector or host cell carrying the down-regulated marker 
gene of interest. Suitable mechanisms for in yivo expression of a gene of interest are known in 
the art 

The invention also includes vaccines and vaccination methods. For example, a 
method of treating or preventing CML in a subject is carried out by administering to the subject a 

20 vaccine containing a polypeptide encoded by a nucleic acid selected from the group consisting of 
CML 1-190 or an immunologically active fragment such a polypeptide. An immunologically 
active fragment is a polypeptide that is shorter in lengtii than the full-length naturally-occurring 
protein and which induces an immune response. For example, an immunologically active 
fragment at least 8 residues in lengfli and stimulates an immune cell such as a T cell or a B cell. 

25 Immune cell stimulation is measured by detecting cell proliferation, elaboration of cytokines 
(e.g., IL-2), or production of an antibody. 

Unless otherwise defined, all technical and scientific terms used herein have the same 
meaning as commonly understood by one of ordinary skill in the art to which this invention 
belongs. Althou^ niethods and materials similar or equivalent to those described herem can be 

30 used in the practice or testing of the present invention, suitable methods and materials are 

described below. All publications, patent applications, patents, and other references mentioned 
herein are incorporated by reference in their entirety. In case of conflict, the present 



wo 2004/031409 



PCT/JP2003/010256 



-7- 

specification, including definitions, will control. In addition, the materials, methods, and 
examples are illustrative only and not intended to be lindting. 

One advantage of the methods described herem is that the disease is identified prior to 
detection of overt clinical symptoms such as expansion of terminally-differentiated neutrophils. 
5 Other features and advantages of the mvention will be apparent firom the following detailed 
description, and fiom the claims. 

BiuEF Description OF THE Figures 

Figure 1 is a photogrq)h of a DNA agarose gel showing expression of representative 
1 1 genes and p-actin examined by semi-quantitative RT-PCR using cDNA prepared fit>m 
10 amplified RNA. The first lane shows the expression level of each gene in a normal individual. 
The remaining eight lanes each show the expression level of the genes in a different CML 
patient Gene symbols are noted for genes v^ose fimction was known or inferred and 
Accession No. Noted for ESTs. 

Detailed Description 

The present invention is based in part on the discovery of changes in expression patterns 
of multiple nucleic acid sequ^ces in mononuclear cells firom peripheral blood of patients with 
Chronic myeloid leukemia (CML). The differences in gene expression were identified by using a 
comprehensive cDNA microarray system. 

Using a cDNA microarray containing 23,040 genes, comprehensive gene-e3q>ression 
profiles were obtained of 27 CMLs. Two hundred ninety-six genes were found to be 
differentially expressed in mononuclear cells firom peripheral blood. Results show that certain 
genes are expressed at low or high levels in CMLs. Selection was made of candidate 
molecular markers with the potential of detecting CML-related proteins in blood, serum, or 
sputum of patients, and discovered some potential targets for development of signal- 
suppressing strategies in CML. 

The differentially expressed genes identified herein are used for diagnostic purposes as 
markers of CML as gene targets, the expression of which is altered to treat or alleviate a 
symptom of CML. 

The genes whose expression levels are modulated (/.^., increased or decreased) in CML 



20 
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patients are summarized in Tables 3-4 and are collectively referred to herein as "CML-associated 
genes"> "CML nucleic acids" or "CML polynucleotides" and the corresponding encoded 
polypeptides are referred to as "CML polypeptides", or "CML proteins." Unless indicated 
otherwise, "CML" is meant to refer to any of the sequences disclosed herein. (e,g„ CML 1-296). 
S The genes that have been previously described are presented along with a database accession 
number* 

By measuring expression of the various genes in a sample of cells, CML is diagnosed. 
Similarly, by measuring the expression of these genes in response to various agents can identify 
agents for treating CML. 

10 The invention involves determining (e.g., measuring) the expression of at least one, and 

up to all the CML sequences listed in Tables 3-4. Using sequence information provided by the 
GeneBank database entries for the known sequences the CML associated genes are detected 
and measured using techniques well known to one of ordinary skill in the art For example, 
sequences within the sequence database entries corresponding to CML sequences, are used to 

15 construct probes for detecting CML RNA sequences in, e.g., northern blot hybridization analyses. 
Probes include at least 10, 20, 50, 100, 200 nucleotides of a reference sequence. As another 
example, the sequences can be used to construct primers for specifically amplifying tiie CML 
sequences in, e.gy amplification-based detection methods such as revCTse-transcription based 
polymerase chain reactiotL 

20 Expression level of one or more of the CML sequences in the test cell population, e.g., a 

patient derived cell sample is then compared to expression levels of the same sequences in a 
reference population. The reference cell population includes one or more cells for which the 
compared parameter is known, Le., CML cells or non-CML cells. 

Whether or not a pattern of gene expression in the test cell population compared to the 

25 reference cell population indicates CML or a predisposition thereto depends upon on the 
composition of the reference cell population. For example, if the reference cell population is 
composed of non-CML cells, a similar gene e^qpression pattem in the test cell population and 
reference cell population indicates tiie test cell population is non-CML. Conversely, if the 
reference cell population is made up of CML cells, a similar gene expression profile between the 

30 test cell population and the reference cell population indicates that the test cell population 
includes CML cells. 
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A level of expression of a CML marker gene in a test cell population is considered 
altered in levels of expression if its expression level varies from the reference cell population by 
more than LO, 1.5, 2.0, 5.0, 10.0 or more fold from the expression level of the corresponding 
CML sequence in the reference cell population. 

5 Differential gene expresdon between a test cell population and a reference ceil 

population is normalized to a control nucleic acid, e.g. a hoxisekeeping gene. For example, a 
control nucleic acid is one v^ch is known not to differ depending on the CML or non-CML 
state of the cell. Expression levels of the control nucleic acid in the test and reference nucleic 
acid can be xised to normalize signal levels in the compared populations. Control genes include 

10 p-actin, glyceraldehyde 3- phosphate dehydrogenase or ribosomal protein PI . 

The test cell population is compared to multiple reference cell populations. Each of the 
multiple reference populations may differ in the known parameter. Thus, a test cell population 
may be compared to a second reference cell population known to contain, e.g.y CML cells, as 
well as a second reference population known to contain, e.g., non-CML cells (normal cells). The 

15 test cell is included in a tissue type or cell sample from a subject known to contain, or to be 
suspected of containing, CML cells. 

The test cell is obtained from a bodily tissue or a bodily fluid, e.g., biological fluid (such 
as blood or sputum). For example, the test cell is purified from a tissue. Preferably, the test cell 
population comprises a mononuclear cell. 

20 Cells in the reference cell population are derived from a tissue type similar to test cell. 

Optionally, the reference cell population is a cell line, e.g a CML cell line (positive control) or a 
normal non-CML cell line (negative control). Alternatively, the control cell population is 
derived from a database of molecular information derived from cells for which the assayed 
parameter or condition is known. 

25 The subject is preferably a mammal. The mammal can be, e.g , a human, non-human 

primate, mouse, rat, dog, cat, horse, or cow. 

Expression of the genes disclosed herem is determined at the RNA level using any 
method known in the art. For example, Northem hybridization analysis using probes which 
specifically recognize one or more of these sequences can be used to determine gene expression. 

30 Alternatively, expression is measured using reverse-transcription-based PCR assays, e.g, using 
primers specific for the differentially expressed sequences. Expression is also determined at the 
protein level, te., by measuring the levels of polypeptides encoded by the gene products 
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described herein, or biological activity thereof. Such methods are well known in the art and 
include, e.g., immunoassays based on antibodies to proteins encoded by the genes. The 
biological activity of the proteins encoded by the genes are also well known. 
Diagnosing CML 

5 CML is diagnosed by measuring the level of expression of one or more CML nucleic acid 

sequences from a test population ofcells,(i.e., a patient derived biological sample). Preferably, 
the test cell population contains a mononuclear cell, e.g., a cell obtained from peripheral blood. 
Other biological samples can be used for measuring the protem level. For example, the protein 
level in the blood, or serum, derived from subject to be diagnosed can be measured by 

10 immxmoassay or biological assay. 

Expression of one or more of CML-associated genes, e.g., CML 1-296 is determined in 
the test cell or biological sample and compared to the expression of the normal control level. A 
normal control level is an expression profile of a CML-associated gene typically found in a 
population known not to be suffering from CML. An increase or a decrease of the level of 

15 expression in the patient derived cell sample of the CML-associated genes indicates that the 
subject is suffering from or is at risk of developing CML. For example, an increase in 
expression of CML 1-190 in the test population compared to the normal control level indicates 
that the subject is suffering from or is at risk of developing CML . Conversely, a decrease m 
expression of CML 191-296 in the test population compared to the normal control level indicates 

20 that the subject is suffering from or is at risk of developing CML. 

When one or more of the CML-associated genes are altered in the test population 
compared to the normal control level indicates that the subject suffers from or is at risk of 
developmg CML. For example, at least 1%, 5%, 25%, 50%, 60%, 80%, 90% or more of the . 
panel of CML-associated genes (CML 1-190, CML 191-296, or CML 1-296) are altered. 

25 

Identifying Agents that inhibit or enhance CML-associated gene expression 

An agent that inhibits the expression or activity of a CML-associated gene is identified 
by contacting a test cell population expressing a CML associated up-regulated gene with a test 
agent and determining the expression level of the CML associated gene. A decrease in 
30 expression in the presence of the agent compared to the normal control level (or compared to the 
level in the absence of the test agent) indicates the agent is an inhibitor of a CML associated up- 
regulated gene and usefril to inhibit CML. 
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Altematively, an agent that enhances the expression or activity of a CML down-regulated 
associated gene is identified by contacting a test cell population expressing an CML associated 
gene with a test agent and determining the esqpression level or activily of the CML associated 
down-regulated gene. An increase of e?q)ression or activity compared to a normal control level 
5 or activity of the CML-associated gene indicates that the test agent augments expression or 
activity of the down-regulated CML associated gene. 

The test cell population is any cell e>q>ressing tihe CML-associated genes. For example, 
the test cell population contains a mononuclear cell, such a cell is isolated fiom peripheral blood. 
Alternatively, the test cell is a cell, vMch has been transfected with a CML associated gene or 
10 which has been transfected with a regulatory sequence (e.g. promoter sequence) from a CML- 
associated gene operably linked to a reporter gene. 

Assessing efficacy of treatment of CML in a subject 

The differentially expressed CML sequences identified herein also allow for the course of 

15 treatment of CML to be monitored, fii this method, a test cell population is provided from a 
subject undergoing treatment for CML. If desired, test cell populations are obtained from the 
subject at various time points before, during, or after treatment Expression of one or more of 
the CML sequences, m the cell population is then determined and compared to a reference cell 
population which includes cells whose CML state is known. The reference cells have not been 

20 exposed to the treatment. 

If the reference cell population contains no CML cells, a similarity in expression between 
CML sequences in the test cell population and the reference cell population indicates that the 
treatment is efficacious. However, a difference in expression between CML sequences in the 
test population and a normal control reference cell population indicates a less favorable clinical 

25 outcome or progQOsis. 

By "efficacious'* is meant that the treatment leads to a reduction in expression of a 
pathologically upregulated gene, increase in expression of a patholo^cally down-regulated gene 
or a decrease leukemic stem cells and their dividing progeny (ie., granulocytic, eryttiroid, and 
megakaryocytic precursors) in a subject. When treatment is applied prophylactically, 

30 "efficacious" means that the treatment retards or prevents a symptom of clinical CML. For 

example, the treatment inhibits a symptom of chronic, acute, or accelerated phase. Assesment of 
CML is made using standard clinical protocols. 
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EfBcaciousness is determined in association with any known method for diagnosing or 
treating CML. CML is diagnosed for example, by identifying symptomatic anomalies, e.g., 
anemia, hypermetabolism, easy fatigability, weakness, weight loss, and anorexia- Other 
characteristics of CML include splenomegaly, thrombocytosis and an almost total lack of 

5 alkaline phosphatase in granulocytes. Patients also exhibit marked elevation of the leukocyte 
count with the chculating cells bemg predominantly neutrophils and metamyelocytes, but 
basophils and eosinophils may also be prominent Furthermore, the Ph^ (Philadelphia) 
chromosome is present in the dividing progeny of multipotent myeloid stem cells (ie., 
granulocytic, erythroid, and megakaryocytic precursors) and lymphoid cells (ie,, B cells) of 

10 approximately 90% of patients with CML. 

Selecting a therapeutic agent for treating CML that is appropriate for a particular individual 

Differences in the genetic makeup of individuals can result in differences in their relative 
abilities to metabolize various drugs. An agent that is metabolized in a subject to act as an 

15 anti-CML agent can manifest itself by inducing a change in gene expression partem in the 

subject's cells from that characteristic of a CML state to a gene expression pattern characteristic 
of a non-CML state. Accordingly, the differentially expressed CML sequences disclosed herem 
allow for a putative therapeutic or prophylactic inhibitor of CML to be tested in a test cell 
population from a selected subject in order to determine if the agent is a suitable anti-CML agent 

20 in the subject. 

To identify an inhibitor or enhancer of CML, that is appropriate for a specific subject, a 
test cell population from the subject is exposed to a therapeutic agent, and the expression of one 
or more of CML 1-296 sequences is determined. 

The test cell popiilation contains an CML cell expressing an CML associated gene. 
25 Preferably, the test cell is a mononuclear cell. For example a test cell population is incubated in 
the pr^ence of a candidate agent and the pattern of gene expression of the test sample is 
measured and compared to one or more reference profiles, e.g., a CML reference expression 
profile or an non-CML reference expression profile. 

A decrease in expression of one or more of the sequences CML 1-190 or an increase in 
30 expression of one or more of the sequences CML 191-296 in a test cell population relative to a 
reference cell population containing CML is indicative that the agent is therapeutic. 
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The test agent can be any compound or composition. For example, the test agents are 
agents that regulate growth and differentiation of hematopoietic precursors, . 

Screening assays for identifying thercpeutic agents 
5 The differentially expressed sequences disclosed herein can also be used to identify 

candidate therapeutic agents for treating a CML. The method is based on screening a candidate 
therapeutic agent to determine if it converts an expression profile of CML 1-296 sequences 
characteristic of a CML state to a pattem indicative of a non-CML state. 

In the method, a cell is exposed to a test agent or a combmation of test agents 

10 (sequentially or consequentially) and the expression of one or more CML 1-296 sequences in the 
cell is measured. The expression profile of the CML sequences in the test population is 
compared to expression level of the CML sequences in a reference cell population that is not 
exposed to the test agent 

An agent effective in stimulating expression of underexpressed genes, or in suppressing 

15 expression of overexpressed genes is deemed to lead to a clinical benefit such compounds are 
further tested for the ability to prevent an inareased myeloid stem cell mass or to prevent 
maturation of leukemic stem cells (ie., pluripotent hematopoietic stem cells), in animals or test 
subjects. 

In a further embodiment, the present invention provides methods for screening candidate . 

20 agents which are potential targets in the treatment of CML. As discussed ui detail above, by 
controlling the expression levels or activities of marker genes, one can control the onset and 
progression of CML. Thus, candidate agents, vAdch are potential targets in the treatment of 
CML, can be identified through screenings that use the expression levels and activities of marker 
genes as indices. In the context of the present invention, such screening may comprise, for 

25 example, the following steps: 

a) contacting a test compound with a polypeptide encoded by CML 1-296; 

b) detecting the binding activity between the polypeptide and tiie test compound; and 

c) selecting a compound that binds to the polypeptide 

Alternatively, the screening method of the present invention may comprise the following 

30 Steps: 

a) contacting a candidate compound with a cell expressing one or more marker genes, 
wherein the one or more marker genes is selected firom the group consisting of CML 1- 
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296; and 

b) selecting a compound that reduces the e3q>ression level of one or more marker 
genes selected fix)m tiie group consisting of CML 1-190, or elevates the expression level of 
one or more marker genes selected from the group consisting of CML 191-296. 

Cells e:q>ressing a marker gene include, for example, cell lines established from CML; such cells 

can be used for the above screening of the present invention. 

Alternatively, the screening method of the present invention may comprise the following 

steps: 

a) contacting a test compoxind with a polypeptide encoded by selected from the group 

consisting of CML 1 -296; 

b) detecting the biological activity of the polypeptide of step (a); and 

c) selecting a compound that suppresses tibte biological activity of the polypeptide encoded 

by CML 1-190 in comparison with the biological activity detected in the absence of the 
test compound, or enhances the biological activity of the polypeptide encoded by CML 
191-296 in comparison with the biological activity detected in the absence of the test 
compoimd. 

A protein required for the screening can be obtained as a recombinant protein using the 
nucleotide sequence of the marker gene. Based on the information of the marker gene, one 
skilled in the art can select any biological activity of the protein as an index for screening and a 
measurement method based on the selected biological activity. 

Alternatively, the screening method of the present invention may comprise the following 

steps: 

a) contacting a candidate compound with a cell into which a vector comprismg the 
transcriptional regulatory region of one or more marker genes and a reporter gene that 
is expressed under the control of the transcriptional regulatory region has been 
introduced, wherein the one or more marker genes are selected from the group 
consisting of CML 1-296 

b) measuring the activity of said reporter gene; and 

c) selecting a compound that reduces the expression level of said reporter gene when said 
marker gene is an up-regulated marker gene selected from the group consisting of 
CML 1-190 or that enhances the expression level of said reporter gene when said 
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marker gene is a down-regulated marker gene selected from the group consisting of 
CML 191-296, as compared to a control. 
Suitable reporter genes and host cells are well known in the art The reporter construct 
required for the screening can be prepared by using the transcriptional regulatory region df a 
5 marker gene. When the transcriptional regulatory region of a marker gene has been known to 
those skilled in the art, a reporter construct can be prepared by using the previous sequence 
mforaiation* When the transcriptional regulatory region of a marker gene remains unidentified, a 
nucleotide segment containing the transcriptional regulatory region can be isolated from a 
genome library based on the nucleotide sequence information of the marker gene. 
10 The compound isolated by the screening is a candidate for drugs that inhibit the activity 

of the protein encoded by marker genes and can be applied to the treatment or prevention of 
CML. 

Moreover, compound in which a part of the structure of the compound inhibiting the 
activity of proteins encoded by marker genes is converted by addition, deletion and/or 
15 replacement are also included in the compounds obtainable by the screening method of the 
present invention. 

When administrating the compound isolated by the method of the invention as a 
phamiaceutical for humans and other mammals, such as mice, rats, guinea-pigs, rabbits, chicken, 
cats, dogs, sheep, pigs, cattle, monkeys, baboons, and chimpanzees, the isolated compound can 

20 be directly administered or can be formulated into a dosage form using known pharmaceutical 
preparation methods. For example, according to the need, the drugs can be taken orally, as 
sugar-coated tablets, capsules, elixirs and microcapsules, or non-orally, in the form of injections 
of sterile solutions or suspensions with water or any other pharmaceutically acceptable liquid. 
For example, the compounds can be mixed with pharmaceutically acceptable carriers or media, 

25 specifically, sterilized ^ter, physiological saline, plant-oils, emulsifiers, suspending agents, 
surfactants, stabilizers, flavoring agents, excipients, vehicles, preservatives, binders, and such, in 
a unit dose form required for generally accepted drug implementation. The amount of active 
ingredients m these preparations makes a suitable dosage within the indicated range acquirable. 
Examples of additives that can be mixed to tablets and c^sules are, binders such as 

30 gelatin, com starch, tragacanth gum and arable gum; excipients such as crystalline cellulose; 
swelling agents such as com starch, gelatin and alginic acid; lubricants such as magnesium 
stearate; sweeteners such as sucrose, lactose or saccharin; and flavoring agents such as 
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peppennint, Gaultheria adenothrix oil and cherry. When the unit-dose form is a capsule, a liquid 
carrier, such as an oil, can also be further hicluded in the above ingredients. Sterile composites 
for injections can be formulated following normal drug implementations using vehicles such as 
distilled water used for injections. 

5 Physiological saline, glucose, and other isotonic liquids including adjuvants, such as D- 

sorbitol, D-mannnose, D-mannitol, and sodium chloride, can be used as aqueous solutions for 
injections.. These can be used in conjunction with suitable solubilizers, such as alcohol, 
specifically ethanpl, polyalcohols such as propylene glycol and polyethylene glycol, non-ionic 
surfactants, such as Polysorbate 80 (TM) and HCO-50. 

10 Sesame oil or Soy-bean oil can be used as a oleaginous liquid and may be used in 

conjxinction with benzyl benzoate or benzyl alcohol as a solubilizer and may be formulated with 
a buffer, such as phosphate buffer and sodium acetate buffer, a pain-killer, such as procaine 
hydrochloride; a stabilizer, such as benzyl alcohol and phenol; and an anti-oxidant The 
prepared mjection may be filled into a suitable ampule. 

15 Methods well known to one skilled in the art may be used to admmister the 

pharmaceutical composition of the present inevntion to patients, for example as intraarterial, 
intravenous, or percutaneous injections and also as intranasal, transbronchial, intramuscular or 
ordl admizustrations. The dosage and method of administration vary according to the body- 
weight and age of a patient and the administration method; however, one skilled in the art can 

20 routinely select a suitable metod of administration* If said compound is encodable by a DNA, 
the DNA can be inserted into a vector for gene therapy and the vector administered to a patient 
to perform the therapy. The dosage and method of administration vary according to the body- 
weight, age, and symptoms of the patient but one skilled in the art can suitably select them. 
For example, although the dose of a compound that binds to the protem of the present 

25 invention and regulates its activity depends on the symptoms, the dose is about 0.1 mg to about 
100 mg per day, preferably about 1.0 mg to about 50 mg per day and more preferably about 1 .0 
mg to about 20 mg per ilay, when administered orally to a normal adult (weight 60 kg). 

When administering parenterally, in the form of an injection to a normal adult (weight 60 
kg), although there are some differences according to the patient, target organ, symptoms and 

30 method of administration, it is convenient to intravenously inject a dose of about 0.01 mg to 
about 30 mg per day, preferably about 0.1 to about 20 mg per day and more preferably about 0.1 
to about 10 mg per day. Also, in the case of other animals too, it is possible to administer an 
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amoimt converted to 60 kgs of body-weight. 

Assessing the prognosis of a subject with CML 

Also provided is a method of assessing the prognosis of a subject with CML by 
comparing the expression of one or more CML sequences in a test cell population to the 
e3q)ression of the sequences in a reference cell population derived fix)m patients over a spectrum 
of disease stages. By comparing gene expression of one or more CML sequences in the test cell 
population and the reference cell populatIon(s), or by comparing the pattem of gene expression 
over time in test cell populations derived jSrom the subject, the prognosis of the subject can be 
assessed 

A decrease in expression of one or more of the sequences CML 191-296 compared to a 
normal control or an increase of expression of one or more of the sequences CML 1-190 
compared to a normal control indicates less favorable prognosis. An increase in expression of 
one or more of the sequences CML 191-296 mdicates a more favorable prognosis, and a 
decrease in e^^ression of sequences CML 1-190 indicates a more favorable prognosis for tiie 
subject 

Kits 

The invention also includes a CML-detection reagent, e.g., a nucleic acid that specifically 
20 binds to or identifies one or more CML nucleic acids such as oligonucleotide sequences, which 
. are complementary to a portion of a CML nucleic acid or antibodies v^ch bind to protems 
encoded by a CML nucleic acid The reagents are packaged together in the form of a kit The 
reagents are packaged in separate containers, e.g., a nucleic acid or antibody (either bound to a 
solid matrix or packaged separately with reagents for binding them to the matrix) , a control 
25 reagent (positive and/or negative), and/or a detectable label. Instructions (e.g., written, tape, 
VCR, CD-ROM, etc.) for carrying out the assay are included in the kit Hie assay format of the 
kit is a Northern hybridization or a sandwich ELIS A known in the art 

For example, CML detection reagent, is immobilized on a solid matrix such as a porous 
strip to form at least one CML detection site. The measurement or detection region of the porous 
30 strip may include a plurality of sites containing a nucleic acid. A test strip may also contain sites 
for negative and/or positive controls. Alternatively, control sites are located on a separate strip 
firom the test strip. Optionally, the dilBferent detection sites may contain different amounts of 
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immobilized nucleic acids, ie., a higher amount in the first detection site and lesser amounts in 
subsequent sites. Upon the addition of test sample, the number of sites displaying a detectable 
signal provides a quantitative indication of the amount of CML present in the sample. The 
detection sites may be configured in any suitably detectable shape and are typically in the shape 
of a bar or dot spanning the width of a teststrip. 

Alternatively, the kit contains a nucleic acid substrate array comprisiog one or more 
nucleic acid sequences. The nucleic acids on flie array specifically identify one or more nucleic 
acid sequences represented by CML 1-296. The expression of 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 
40 or 50 or more of flie sequences represented by CML 1-296 are identified by virtue of the level 
of bindmg to an array test strip or chip. The substrate array can be on, e.g., a solid substrate, e.g., 
a "chip" as described m U.S. Patent No.5,744305. 

Arrays and pluralities 

The invention also includes a nucleic acid substiate array comprising one or more nucleic 
acid sequences. The nucleic acids on the array specifically correspond to one or more nucleic 
acid sequences represented by CML 1-296. The level of expression of 2, 3, 4, 5, 6, 7, 8, 9, 10, 
15, 20, 25, 40 or 50 or more of the sequences represented by CML 1-296 are identified by 
detecting nucleic acid binding to the army. 

The invention also includes an isolated plurality (ie., a mixture of two or more nucleic 
acids) of nucleic acid sequences. The nucleic acid sequences are in a liquid phase or a solid 
phase, e.g., unmobilized on a solid support such as a nitrocellulose membrane. The plurality 
includes one or more of the nucleic acid sequences represented by CML 1-296. In various 
embodiments, the pluraUty includes 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 40 or 50 or more of the 
sequences represented by CML 1-296. 

Methods of inhibiting CML 

The invention provides a mefliod for treating or alleviating a symptom of CML in a 
subject by decreasing expression or activity of CML 1-190 or increasmg expression or activity of 
CML 191-296. Ther^eutic compounds are admmistered prophylactically or therapeutically to a 
subject suffering from or at risk of (or susceptible to) developing CML. Such subjects are 
identified using standard clinical methods or by detecting an aberrant level of expression or 
activity of (e.g., CML 1-296). Therapeutic agents include mhibitors of cell cycle regulation, cell 
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proliferation> and protein kinase activity. Preferably, the inliibitor of kinase activity is not 
STI571. Alternatively, STI571 is administered together with one or more of the inhibitors of 
CML 1-296. 

The therapeutic method includes increasing the expression, or function, or both of one or 

5 more gene products of genes whose e>q>ression is decreased Cunderexpressed genes") in a CML 
cell relative to normal cells of the same cell type fcom which the CML cells are derived. In 
diese mefliods, the subject is treated with an effective amount of a compound, vAAch increases 
the .amount of one of more of the underexpressed genes in the subject. Administration can be 
systemic or local. Therapeutic compounds include a polypeptide product of an imderexpressed 

10 gene, or a biologically active fragment thereof, a nucleic acid encoding an imderexpressed gene 
and having expression control elements permitting expression in the CML cells; for example an 
agent which increases the level of expression of such gene endogenous to the CML cells (i.e., 
which up-regulates expression of the underexpressed gene or genes). Administration of such 
compounds counter the effects of aberrantly-under expressed of the gene or genes in the 

15 subject's hematopoietic cells and improves the clinical condition of the subject 

The method also includes decreasing the expression, or function, or both, of one or more 
gene products of genes whose expression is aberrantly increased ("overe3q)ressed gene'*) in 
hematopoietic cells including hematopoietic stem cells. Expression is inhibited hi any of several 
ways known in the art For example, expression is inhibited by administering to the subject a 

20 nucleic acid that inhibits, or antagonizes, the expression of the overexpressed gene or genes, e.g., 
an antisense oligonucleotide or small interfering RNA which disrupts expression of the 
overexpressed gene or genes. 

As noted above, antisense nucleic acids corresponding to the nucleotide sequence of 
CML 1-190 can be used to reduce the expression level of the CML 1-190. Antisense nucleic 

25 acids corresponding to CML 1-190 that are up-regulated in CML are useful for the treatment of 
CML. Specifically, the antisense nucleic acids of the present invention may act by binding to the 
CML 1-190 or mRNAs corresponding thereto, thereby mhibiting the transcription or translation 
of the genes, promoting the degradation of the mRNAs, and/or inhibiting the expression of 
proteins encoded by the CML 1-190, finally hihibitmg the function of the protems . The term 

30 "antisense nucleic acids" as used herem encompasses both nucleotides that are entirely 

complementary to the target sequence and those having a mismatch of one or more nucleotides, 
so long as the antisense nucleic acids can specifically hybridize to the target sequences. For 
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example, the antisense nucleic acids of the present invention include polynucleotides that have a 
homology of at least 70% or higher, preferably at 80% or higher, more preferably 90% or higher, 
even more preferably 95% or higher over a span of at least 15 continuous 'nucleotides. 
Algorithms kaovm in the art can be used to determine the homology* 
5 The antisense nucleic acid derivatives of the present invention act on cells producing the 

protems encoded by marker genes by binding to the DNAs or mKNAs encoding the proteins, 
inhibiting their transcription or translation, promoting the degradation of the mRNAs, and 
inhibiting the e:xpression of the proteins, thereby resulting in the inhibition of the protein 
fiinction. 

10 An antisense nucleic acid derivative of the present invention can be made into an extemal 

preparation, such as a liniment or a poultice, by mixing with a suitable base material which is 
inactive against the derivative. 

Also, as needed, the derivatives can be formulated into tablets, powders, granules, 
capsules, liposome capsules, injections, solutions, nose-drops and freeze-drying agents by adding 
15 excipients, isotonic agents, solubilizers, stabilizers, preservatives, pain-killers, and such. These 
can be prepared by following known methods. 

The antisense nucleic acids derivative is given to the patient by directly ^plying onto the 
ailing site or by injecting into a blood vessel so that it will reach the site of aihnent An 
antisense-mounting medium can also be used to increase durability and membrane-pemieability. 
20 Examples are, liposomes, poly-L-lysine, lipids, cholesterol, lipofectin or derivatives of these. 
The dosage of the antisense nucleic acid derivative of the present invention can be 
adjusted suitably according to the patient's condition and used in desired amounts. For example, 
a dose range of 0.1 to 100 mg/kg, preferably 0.1 to 50 mg/kg can be administered. 

The antisense nucleic acids of the invention inhibit the expression of the protein of the 
25 invention and is thereby useful for siq>pressing the biological activity of a protein of the 
invention. Also, expression-inhibitors, comprising the antisense nucleic acids of the invention, 
are useful since they can inhibit the biological activity of a protein of the invention. 

The antisense nucleic acids of present invention include modified oligonucleotides. For 
example, boated nucleotides may be used to confer nuclease resistance to an oligonucleotide. 
30 Also, an siKNA against marker gene can be used to reduce the expression level of the 

marker gene. By the term "siRNA" is meant a double stranded RNA molecule which prevents 
translation of a target mRNA. Standard techniques of introducing siRNA into the cell are used. 
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including those in which DNA is a template fix>m which RNA is transcribed. In the context of 
the present invention, the siRNA comprises a sense nucleic acid sequence and an anti-sense 
nucleic acid sequence against an upregulated marker gene, such as CML 1-190. The siKNA is 
constructed such that a single transcript has both the sense and complementary antisense 
5 sequences £rom tiie target gene, e.g., a hairpin. 

The method is used to alter the expression in a cell of an upregulated, e.g., as a result of 
malignant transformation of the cells. Binding of the siRNA to a transcript corresponding to one 
of die CML 1-190 in the target cell results in a reduction in the protein production by the ceU. 
The length of the oligonucleotide is at least 10 nucleotides and may be as long as the naturally- 
10 occurring the transcript. Preferably, the oligonucleotide is 19-25 nucleotides in length* Most 
preferably, the oligonucleotide is less than 75, 50 , 25 nucleotides in length. 

The nucleotide sequence of the siRNAs were designed usiag an siRNA design computer 
program available from the Ambion website (ht^://www.ambion.coni/techlib/ 
misc/siRNA_finder Jitml). The computer program selects nucleotide sequences for siRNA 
15 synthesis based on the following protocol. 

Selection of siRNA Target Sites: 

1 . Beginning with the AUG start codon of the object transcript, scan downstream for AA 
dinucleotide sequences. Record the occurrence of each AA and the 3' adjacent 19 
nucleotides as potential siRNA target sites. Tuschl, et al. recommend against designing 

20 siRNA to the 5' and 3* untranslated regions (UTRs) and regions near the start codon (within 
75 bases) as these may be richer in regulatory protein binding sites. UTR-binding proteins 
and/or translation initiation complexes may interfere with binding of the siRNA 
endonuclease complex. 

2. Compare the potential target sites to the human genome database and eliminate from 

25 consideration any target sequences with significant homology to other coding sequences. 
The homology search can be performed using BLAST, vMch can be found on the NCBI 
server at: wwwjicbijQlm.nih.gov/BLAST/ 

3. Select qualifying target sequences for synthesis. At Ambion, preferably several target 
sequences can be selected along the length of the gene for evaluation. 

30 The antisense oligonucleotide or siRNA of the invention inhibit the expression of the 

polypeptide of the invention and is thereby useful for suppressing the biological activity of the 
polypeptide of the invention. Also, expression-inhibitors, comprising the antisense 
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oligonucleotide or siRNA of the invention, are useftil in the point that they can inhibit the 
biological activity of the polypeptide of the invention. Therefore, a composition comprising the 
antisense oligonucleotide or siRNA of the present invention are useful in treating a CML. 

Alternatively, function of one or more gene products of the overexpressed genes is 

S inhibited by administering a cbmpound that binds to or otherwise inhibits tiie function of the 
gene products. For example, the compound is an antibody which binds to the overexpressed 
gene product or gene products. 

The present invention refers to the use of antibodies, particularly antibodies against a 
protein encoded by an up-regulated marker gene, or a fragment of the antibody. As used herein, 

10 the term "antibody refers to an immunoglobulin molecule having a specific structure, that 
interacts (i.e., binds) only with the antigen that was used for synthesizing the antibody (i.e., the 
up-regulated marker gene product) or with an antigen closely related to it Furthermore, an 
antibody may be a j&agment of an antibody or a modified antibody, so long as it binds to one or 
more of the proteins encoded by the maricer genes. For instance, the antibody firagment may be 

15 Fab, F(ab')2, Fv, or single chain Fv (scFv), in which Fv fragments from H and L chains are 
ligated by an appropriate linker (Huston J. S. et al. Proc. Natl. Acad. Sci. U.S A. 85:5879-5883 
(1988)). More specifically, an antibody fragment may be generated by treating an antibody with 
an enzyme, such as papain or pepsin. Altematively, a gene encoding the antibody fragment may 
be constructed, inserted into an expression vector, and expressed in an appropriate host cell (see, 

20 for example, Co M. S. et al. J. Immunol. 152:2968-2976 (1994); Better M. and Horwitz A. H. 
Methods Enzymol. 178:476-496 (1989); Pluckthun A. and Skeira A. Methods Enzymol. 
178:497-515 (1989); Lamoyi E. Methods Enzymol. 121:652-663 (1986); Rousseaux J. et al. 
Methods Enzymol. 121:663-669 (1986); Bird R. E. and Walker B. W. Trends Biotechnol. 9:132- 
137(1991)). 

25 An antibody may be modified by conjugation with a variety of molecules, such as 

polyethylene glycol CPEG). The present invention provides such modified antibodies. The 
modified antibody can be obtained by chemically modifying an antibody. These modification 
methods are conventional in the field. 

Altematively, an antibody may be obtained as a chimeric antibody, between a variable 

30 region derived from a nonhuman antibody and a constant region derived from a human antibody, 
or as a humanized antibody, comprising the complementarity determining region (CDR) derived 
from a nonhuman antibody, the frame work region (FR) derived from a human antibody, and the 
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constant region. Such antibodies can be prepared by using known technologies. 

Cancer therapies directed at specific molecular alterations that occur in cancer cells have 
been validated through clinical development and regulatory approval of anti-cancer drugs such 
as trastuzumab (Herceptin) for the treatment of advanced breast cancer, imatinib methylate 

5 (Gleevec) for chronic myeloid leukemia, gefitinib (Lressa) for non-small cell lung cancer 
(NSCLC), and rituximab (anti-CD20 mAb) for B-cell lymphoma and mantle cell lymphoma 
(Ciardiello F, Tortora G. A novel approach m the treatment of cancer: targeting the epidermal 
growth factor receptor. Clin Cancer Res. 2001 Oct;7(10):2958-70. Review.; Slamon DJ, 
Leyland-Jones B, Shak S, Fuchs H, Paton V, Bajamonde A, Fleming T, Eiermann W, Wolter J, 

10 Pegram M, Baselga J, Norton L. Use of chemotherapy plus a monoclonal antibody against HER2 
for metastatic breast cancer that overexpresses HER2. N Engl J Med. 2001 Mar 15;344(1 1):783- 
92.; Rehwald U, Schulz H, Reiser M, Sieber M, Staak JO, Morschhauser F, Driessen C, Rudiger 
T, Muller-Hemielmk K, Diehl V, Engert A. Treatment of relapsed CD20^ Hodgkin lymphoma 
with the monoclonal antibody rituximab is effective and well tolerated: results of a phase 2 trial 

15 of the German Hodgkin Lymphoma Study Group. Blood. 2003 Jan 15;101(2):420-424.; Fang G, 
Kim CN, Perkins CL, Ramadevi N, Wmton E, Wittmann S and Bhalla KN. (2000). Blood, 96, 
2246-2253.). These drugs are clinically effective and better tolerated than traditional anti-cancer 
agents because they target only transformed cells. Hence, such drugs not only improve survival 
and quality of life for cancer patients, but also validate the concept of molecularly targeted 

20 cancer therapy. Furthermore, targeted drugs can enhance the efficacy of standard chemotherapy 
' when used in combination with it (Gianni L. (2002). Oncology, 63 Suppl 1, 47-56.; Klejman A, 
Rushen L, Morrione A, Slupianek A and Skorski T. (2002). Oncogene, 21, 5868-5876.). 
Therefore, future cancer treatments will probably involve combining conventional drugs with 
target-specific agents aimed at different chamcteristics of tumor cells such as angiogenesis and 

25 invasiveness. 

These modulatory methods are performed ex v/va or in vitro (e.g., by culturing the cell 
with the agent) or, alternatively, in vivo (e.g., by administering the agent to a subject). The 
method involves administering a protein or combination of proteins or a nucleic acid molecule or 
combination of nucleic acid molecules as thempy to counteract aberrant expression or activity of 
30 the differentially expressed genes. 

Diseases and disorders that are characterized by increased (relative to a subject not 
suffering fix>m the disease or disorder) levels or biological activity of the genes may be treated 
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with therapeutics that antagonize (/.a, reduce or inhibit) activity of the overexpressed gene or 
genes. Therapeutics that antagonize activity are administered therapeutically or prophylactically. 

Therapeutics that may be utilized include, e.g., (i) a polypeptide, or analogs, derivatives, 
j&agments or homologs thereof of the overexpressed or underexpressed sequence or sequences; 
5 (it) antibodies to flie overexpressed or underexpressed seqxience or sequences; (Hi) nucleic acids 
encoding the over or underexpressed sequence or sequences; (iv) antisense nucleic acids or 
nucleic acids that are "dysfunctional" (Le., due to a heterologous insertion within the coding 
sequences of coding sequences of one or more overexpressed or undere?q>ressed sequences); (v) 
small interfering RNA (siRNA); or (vi) modulators (i.e., inhibitors, agonists and antagonists that 

10 alter the interaction between an over/underexpressed polypeptide and its binding partner. The 
dysfunctional antisense molecules are utilized to "knockout" endogenous fiinction of a 
polypeptide by homologous recombination (see, e.g., Capecchi, Science 244: 1288-1292 1989). 
The term "siRNA" refers to a double stranded RNA molecule which prevents translation of a 
target mRNA. Standard techniques are used for introducing siRNA into cells, including those 

15 wherein DNA is used as the template to transcribe RNA. The siRNA comprises a sense nucleic 
acid sequence and an anti-sense nucleic acid sequence of the any one of the CML 1-190 gene. 
The siRNA is constructed such that a single transcript (double stranded RNA) has both the sense 
and complementary antisense sequences from the target gene, e.g., a hairpin. 

Diseases and disorders that are characterized by decreased (relative to a subject not 

20 suffering from the disease or disorder) levels or biological activity may be treated with 

therapeutics that increase (/.e., are agonists to) activity. Therapeutics that upregulate activity 
may be administered in a therapeutic or prophylactic manner. Therapeutics that may be utilized 
include, but are not limited to, a polypeptide (or analogs, derivatives, fragments or homologs 
thereof) or an agonist that increases bioavailability. 

25 Increased or decreased levels can be readily detected by quantifying peptide and/or RNA, 

by obtaining a patient cell sample and assaying it in vitro for RNA or peptide levels, structure 
and/or activity of the expressed peptides (or mRNAs of a gene whose e?q)ression is altered). 
Methods that are well-known within the art include, but are not limited to, immunoassays (e.g., 
by Westem blot analysis, immunoprecipitation followed by sodium dodecyl sulfete (SDS) 

30 polyacrylamide gel electrophoresis, immunocytochemistry, etc.) and/or hybridization assays to 
detect expression of mRNAs (e.g., Northern assays, dot blots, in situ hybridization, etc.). 
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Prophylactic administration occurs prior to the manifestation of overt clinical symptoms 
of disease, such that a disease or disorder is prevented or> alternatively, delayed in its progression. 

Therapeutic methods includes contacting a cell with an agent that modulates one or more 
of the activities of the gene products of the differentially expressed genes. An agent that 
modulates protein activity includes a nucleic acid or a protein, a naturally-occurring cognate 
ligand of these proteins, a peptide, a peptidomimetic, or other small molecule. For example, the 
agent stimulates one or more protem activities of one or more of a differentially under-e3q)ressed 
gene. 

The present invention also relates to a method of treating or preventing CML in a 
subject comprising administering to said subject a vaccine comprising a polypeptide encoded by 
a nucleic acid selected from the group consisting of CML 1-190 or an immimologically active 
fragment of said polypeptide, or a polynucleotide encoding the polypeptide or the fragment 
thereof An administration of the polypeptide induce an anti-tumor inmiunity in a subject To 
inducing anti-tumor immunity, a polypeptide encoded by a nucleic add selected from the group 
consisting of CML 1-190 or an immunologically active fragment of said polypeptide, or a 
polynucleotide encoding the polypeptide is administered. The polypeptide or the 
immunologically active fi-agments thereof are useful as vaccines against CML. In some cases 
the proteins or fragments thereof may be administered in a form bound to the T cell recepor 
(TCR) or presented by an antigen presenting cell (APC), such as macrophage, dendritic cell 
(DC), or B-cells. Due to the strong antigen presenting ability of DC, the use of DC is most 
preferable among the APCs. 

In the present invention, vaccine against CML refers to a substance that has the fimction 
to induce anti-tumor immunity upon inoculation into animals. According to the present 
invention, polypeptides encoded by CML 1-190 or fragments tiiereof were suggested to be HLA- 
A24 or HLA-A*0201 restricted epitopes peptides that may induce potent and specific umnune 
response gainst CML cells expressmg CML 1-190. Thus, the present invention also 
encompasses method of inducing anti-tumor immunity using the polypeptides. In general, anti- 
tumor immunity includes immune responses such as follows: 

- induction of cytotoxic lymphocytes against tumors, 

- induction of antibodies that recognize tumors, and 

- induction of anti-tumor cytokine production. 

Therefore, when a certain protein induces any one of these immune responses upon 
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inoculation into an animal, the protein is decided to have anti-tumor immimity inducing effect. 
The induction of the anti-tumor immunity by a protein can be detected by observing in vivo or in 
vitro the response of the immune system in the host against the protein. 

For example, a method for detecting the induction of cytotoxic T lymphocytes is well 
known. A foreign substance that enters the living body is presented to T cells and B cells by the 
action of antigen presenting cells (APCs). T cells that respond to the antigen presented by APC 
in antigen specific manner dififerentiate mto cytotoxic T cells (or cytotoxic T lymphocytes; 
CTLs) due to stimulation by the antigen, and then proliferate (this is referred to as activation of T 
cells). Therefore, CTL induction by a certain peptide can be evaluated by presenting the peptide 
to T cell by APC, and detecting the induction of CTL. Furfhermore, APC has the effect of 
activating CD4+ T cells, CD8+ T cells, macrophages, eosinophils, and NK ceils. Since CD4+ T 
cells and CD8+ T cells are also important m anti-tumor immunity, the anti-tumor immunity 
inducing action of the peptide can be evaluated using the activation effect of these cells as 
indicators. 

A method for evaluating tiie inducmg action of CTL usmg dendritic cells (DCs) as APC 
is well known in the art. DC is a representative APC having the strongest CTL inducing action 
among APCs. In this method, the test polypeptide is initially contacted with DC, and then this 
DC is contacted witii T cells. Detection of T cells having cytotoxic effects against the cells of 
interest after the contact with DC shows that the test polypeptide has an activity of inducing the 
Cytotoxic T cells. Activity of CTL against tumors can be detected, for example, using the lysis 
of 51Cr-labeled tumor cells as the indicator. Alternatively, the method of evaluating the degree 
of tumor ceil damage using 3H-thymidine uptake activity or LDH (lactose dehydrogenase)- 
release as the indicator is also well known. 

Apart from DC, peripheral blood mononuclear cells (PBMCs) may also be used as the 
APC. The induction of CTL is reported that the it can be enhanced by culturing PBMC in the 
presence of GM-CSF and IL-4. Similarly, CTL has been shown to be induced by culturing 
PBMC in tfie presence of keyhole limpet hemocyanin (KLH) and IL-7. 

The test polypeptides confirmed to possess CTL inducing activity by these methods are 
polypeptides having DC activation effect and subsequent CTL inducing activity. Therefore, 
polypeptides that induce CTL against tumor cells are useful as vaccines against tumors. 
Furthermore, APC that acquired the ability to induce CTL against tumors by contacting with the 
polypeptides are usefiil as vaccines against tumors. Furthermore, CTL that acquired cytotoxicity 
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due to presentation of the polypeptide antigens by APC can be also used as vaccines c^ainst 
tumors. Such therapeutic methods for tumors using anti-tumor immunity due to APC and CTL 
are referred to as cellular immunotherapy. 

Generally, when using a polypeptide for cellular immunothempy, eflBciency of the 
CTL-induction is known to increase by combining a plurality of polypeptides having difTerent 
structures and contacting them with DC. Therefore, M^en stimulating DC with protein 
fiagments, it is advantageous to use a mixture of multiple types of fragments. 

Alternatively, the induction of anti-tumor immunity by a polypeptide can be confirmed 
by observing the induction of antibody production against tumors. For example, when 
antibodies against a polypeptide are induced in a laboratory animal immunized with the 
polypeptide, and when growth of tumor cells is suppressed by those antibodies, the polypeptide 
can be determined to have an ability to induce anti-tumor immunity. 

Anti-tumor immunity is induced by administering the vaccine of this invention, and the 
mduction of anti-tumor immunity enables treatment and prevention of CML. Therapy against 
cancer or prevention of the onset of cancer includes any of the steps, such as inhibition of flie 
growth of cancerous cells, involution of cancer, and suppression of occurrence of cancer. 
Decrease in mortality of individuals having cancer, decrease of tumor markers in the blood, 
alleviation of detectable symptoms accompanying cancer, and such are also included in the 
therapy or prevention of cancer. Such therapeutic and preventive effects are preferably 
statistically significant. For example, in observation, at a significance level of 5% or less, 
wherein the therapeutic or preventive eJBfect of a vaccine against cell prolifemtive diseases is 
compared to a control without vaccine administmtion. For example, Student's t-test, the Mann- 
Whitney U-test, or ANOVA may be used for statistical analyses. 

The above-mentioned protein having immunological activity or a vector encoding the 
protein may be combined with an adjuvant An adjuvant refers to a compoimd that enhances the 
inmiune response against the protein when administered together (or successively) with the 
protein having immunological activity. Examples of adjuvants include cholera toxin, salmonella 
toxin, alum, and such, but are not limited thereto. Furthermore, the vaccine of this invention 
may be combined appropriately with a pharmaceutically acceptable carrier. Examples of such 
carriers are sterilized water, physiological saline, phosphate buffer, culture fluid, and such. 
Furthermore, the vaccine may contain as necessary, stabilizers, suspensions, preservatives, 
surfectants, and such. The vaccine is administered systenaically or locally. Vaccine 
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administration may be perfonned by single administration, or boosted by multiple 
administrations. 

When using APC or CTL as the vaccine of this invention, tumors can be treated or 
prevented, for example, by the ex vivo method More specifically, PBMCs of the subject 

5 receiving treatment or prevention are collected, the cells are contacted with the polypeptide ex 
vivo, and following the induction of APC or CTL, the cells may be administered to the subject 
APC can be also induced by introducing a vector encoding the polypeptide into PBMCs ex vivo. 
APC or CTL induced in vitro can be cloned prior to administratioiL By cloning and growing 
cells having high activity of damaging target cells, cellular immunothempy can be performed 

10 more effectively. Furthermore, APC and CTL isolated in this manner may be used for cellular 
immunotherapy not only against individuals from whom the cells are derived, but also against 
similar types of tumors from other individuals. 

Furthermore, a pharmaceutical composition for treating or preventing a cell proliferative 
disease, such as cancer, comprising a pharmacexitically effective amount of the polypeptide of 

15 the present invention is provided. The pharmaceutical composition may be used for raising anti 
tumor immunity. 

Pharmaceutical compositions for inhibiting CML 

Pharmaceutical formulations include those suitable for oral, rectal, nasal, topical 

20 (including buccal and sub-lingual), vaginal or parenteral (including iatramuscular, subK^utaneous 
and intmvenous) administration, or for administration by inhalation or insufflation. Preferably, 
admmistration is intravenoxxs. The formulations are optionally packaged in discrete dosage units 
Pharmaceutical formxilations suitable for oral administration include capsules, cachets or 
tablets, each containing a predetermined amount of the active ingredient Formulations also 

25 include powders, granules or solutions, suspensions or emulsions. The active ingredient os 
optionally administered as a bolus electuary or paste. Tablets and capsules for oral 
administration may contain conventional excipients such as binding agents, fillers, lubricants, 
disintegrant or wetting agents. A tablet may be made by compression or molding, optionally with 
one or more formulational ingredients. Compressed tablets may be prepared by compressmg in a 

30 suitable machine the active ingredients in a fi:ee-flowing form such as a powder or granules, 
optionally mixed with a binder, lubricant, inert diluent, lubricating, surface active or dispersing 
agent. Molded tablets may be made by molding in a suitable machine a mixture of the powdered 
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compound moistened with an inert liquid diluent The tablets may be coated according to . 
methods well known in the art. Oral fluid preparations may be in th^e form of, for example, 
aqueous or oily suspensions, solutions, emulsions, syrups or elixirs, or may be presented as a dry 
product for constitution with water or other suitable vehicle before use. Such liquid preparations 
5 may contain conventional additives such as suspending agents, emulsifying agents, non-aqueous 
vehicles (which may include edible oils), or preservatives* The tablets may optionally be 
formulated so as to provide slow or controlled release of the active mgredient therein. A 
package of tablets may contain one tablet to be taken on ech of tihe month. The formulation or 
dose of medicament varies with respect to the phase (chronic, accelerated, or blast crisis) of the 
10 CML. 

Formulations for parenteral administration include aqueous and non-aqueous sterile 
injection solutions which may contain anti-oxidants, buffers, bacteriostats and solutes which 
render the formulation isotonic with the blood of the intended recipient; and aqueous and non- 
aqueous sterile suspensions i^ch may include suspending agents and thickening agents. The 

15 formulations may be presented ia unit dose or multi-dose containers, for example sealed 

ampoules and vials, and may be stored in a freeze-dried (lyophilized) condition requiring only 
the addition of the sterile liquid carrier, for example, saline, water-for-injection, immediately 
prior to use. Alternatively, the formulations may be presented for continuous infusion. 
Extemporaneous injection solutions and suspensions may be prepared from sterile powders, 

20 granules and tablets of the kind previously described. 

Formulations for rectal administration include suppositories with standard carriers such 
as cocoa butter or polyethylene glycol. Formulations for topical administration in the mouth, for 
example buccally or sublingually, include lozenges, which contain the active ingredient in a 
flavored base such as sucrose and acacia or tragacanth, and pastilles comprising the active 

25 ingredient in a base such as gelatin and glycerin or sucrose and acacia. For intra-nasal 
administration the compounds of the invention may be used as a liquid spray or dispersible 
powder or in the form of drops. Drops may be formulated with an aqueous or non-aqueous base 
also comprising one or more dispersiag agents, solubilizing agents or suspending agents. 
For administmtion by inhalation the compounds are conveniently delivered from an 

30 insufflator, nebulizer, pressurized packs or other convenient means of delivering an aerosol spray. 
Pressurized packs may comprise a siiitable propellant such as dichlorodifluoromethane, 
trichlorofluoromethane, dichiorotetrafluoroethane, carbon dioxide or other suitable gas. In the 
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case of a pressurized aerosol, the dosage unit may be determined by providing a valve to deliver 
a metered amount 

Alternatively, for administration by inhalation or insufflation, the compounds may take 
the form of a dry powder composition^ for example a powder mix of the compound and a 
5 suitable powder base such as lactose or starch. The powder composition may be presented in 
unit dosage form, in for example, capsules, cartridges, gelatin or blister packs from which the 
powder may be administered with the aid of an inhalator or insufQators. 

Other formulations include implantable devices and adhesive patches; which release a 
therapeutic agent 

10 When desired, the above described formulations, adapted to give sustained release of the 

active ingredient, may be employed. The pharmaceutical compositions may also contain other 
active ingredients such as antimicrobial agents, immimosuppressants or preservatives* 

It should be understood that in addition to the ingredients particxilarly mentioned above, 
the formulations of this invention may include other agents conventional in the art having regard 

15 to the type of formulation in question, for example, those suitable for oral administration may 
include flavoring agents. 

Preferred unit dosage formulations are those containing an effective dose, as recited 
below, or an appropriate fraction thereof, of the active ingredient 

For each of the aforementioned conditions, the compositions, e.g., polypeptides and 

20 organic compounds are administered orally or via injection at a dose of from about 0, 1 to about 
250 mg/kg per day. The dose range for adult humans is generally from about 5 mg to about 17.5 
g/day, preferably about 5 mg to about 10 g/day, and most preferably about 100 mg to about 3 
g/day. Tablets or other unit dosage forms of presentation provided in discrete units may 
conveniently contain an amount which is effective at such dosage or as a multiple of the same, 

25 for instance, units contaming about 5 mg to about 500 mg, usually from about 1 00 mg to about 
500 mg. 

The dose employed will depend upon a number of Actors, including the age and sex of 
the subject, the precise disorder being treated, and its severity. Also the route of administration 
may vary depending upon the condition and its severity. 
30 The mvention will be further described in the following examples, which do not limit the 

scope of the invention described in the claims. The following examples illustrate the 
identification and characterization of genes differentially expressed in CML cells. 
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Example 1: Preparation of Test Samples 

Samples obtained from diseased cells and nonnal cells, e.g,, mononuclear cells from 
peripheral blood, were evaluated to identify genes which are differently e3q>ressed in a disease 
5 state, e.g., CML. The assays were cam^ out as follows. 

Patients and samples 

Periidieral blood samples were obtained from 27 CML patients prior to treatment 
wi1hSTI571. EachpatientwasthenemoUedintoaphasellstiidyof STI571. To characterize 
10 CML cells, mRNA from 27 samples in which more than 65% of cells had been positive for 
the Ph chromosome prior to treatment by a FISH analysis detecting a bcr/abl fusion gene (13) 
were analyzed on a cDNA-microarray system. Of the 27, two cases were in accelerated phase 
and three cases were in blast crisis phase (Table 1). A mixture of mononuclear cells from 
peripheral blood from eleven healthy volunteers was used as a control. 

15 



Table 1 Clmicopathological features of patients examined 



Patient's 


Age(y) 


Sex 


Ph(+) 


Phase 


ID 






(%)a 




CML002 


71 


F 


78 


Chronic 


CML003 


66 


M 


69 


Chronic 


CML004 


55 


F 


75.5 


Chronic 


CML008 


61 


F 


75 


Chronic 


CML009 


68 


M 


80.5 


Chronic 


CMLOlO 


56 


M 


65.5 


Chronic 


CML013 


59 


F 


87 


Chronic 


CML014 


47 


M 


83.5 


Chronic 


CML015 


63 


F 


72 


Chronic 


CML018 


57 


M 


83.5 


Chronic 


CML019 


23 


M 


79 


Chronic 


CML021 


57 


M 


79.5 


Chronic 


CML022 


69 


M 


72.5 


Chronic 


CML023 


68 


F 


76 


Chronic 


CML025 


44 


M 


76 


Chronic 


CML027 


35 


M 


75.5 


Chronic 


CML029 


45 


F 


73 


Chronic 


CML030 


61 


M 


65.5 


Chronic 



wo 2004/031409 



PCT/JP2003/010256 



32 - 



CML033 
CML036 
CML047 
CML054 



56 
48 
32 
32 



M 
M 
F 
M 



66 
77 
85.5 
71 



Chronic 
Chronic 
Chronic 
Chronic 



RNA Preparatioa and T7-based RNA amplification 

Mononuclear cells were prepared using FicoU (Amersham Biosciences, Buckinghamshire, 

5 UK) and total RNA was extracted using TRIzol (Life Technologies, Inc., Grand Island, NY) 
according to the manufacturer's instructions. After treatment with DNase I (Nippon Gene, 
Tolqro, Japan), T7-based RNA amplification was carried out (14). Two rounds of amplification 
using 2 ^g of total RNA as starting material yielded 40-100 ^ig of amplified RNA (aRNA). For 
control samples ftom healthy volunteers, two rounds of T7-based RNA amplification was also 

10 performed to obtain a sufficient amount of aRNA* RNA amplified by this method accurately 
reflects the proportions m the origLoal RNA source, as confirmed by semi-quantitative reverse 
transcriptionr-polymerase chain reaction OR.T-PCR) experiments, in which data fiom the 
microarrays were consistent with results from RT-PCR whether total RNA or amplified aRNA 
was used as the template (14). 

15 

Preparation of the cDNA microarrav and hybridization 

A genome-wide cDNA microarray was fabricated with 23,040 cDNAs selected from the 
UniGene database (build #13 1) of the National Center for Biotechnology Information (NCBI). 
To obtain cDNAs for spotting on the glass slides, RT-PCR was performed for each gene as 

20 described previously. (15) The PCR products were spotted on type 7 glass slides (Amersham 
Biosciences) by Microarray Spotter Generation HI (Amersham Biosciences); 4,608 genes were 
spotted in duplicate on a single slide. Five different sets of slides (total 23,040 genes) were 
prepared, on each of which the same 52 housekeeping genes and two negative-control genes 
were spotted as well. Labeling, hybridization, washing, scanning, and quantification of signals 

25 were performed as described previously (14) except fliat all processes were carried out witii an 
Automated Slide Processor (15). 



Quantification of signals and data analvsis 
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The intensity of each hybridization signal was calculated photometrically using the 
ArrayVision computer program (Amersham Biosciences). Each slide contained 52 
housekeeping genes, and the Cy5/Cy3 ratio for each gene's expression was adjusted so that 
the averaged Cy5/Cy3 ratio of the panel of housekeeping genes was 1 .0. A cut-oflf value was 
5 assigned to each microarray slide, using variance analysis. If both Cy3 and Cy5 signal 

intensities were lower than the cut-off values, the expression level of the corresponding gene 
in that sample was assessed as absent. For other genes the Cy5/Cy3 ratio was calculated 
using raw data of each sample. 

10 Example 2: Identification of CML - associated genes 

The relative expression ratio of each gene (Cy5/Cy3 intensity ratio) was classified into 
one of four categories: (1) highly up-regulated (expression ratio more than 5.0 in more than 50% 
of the informative cases); (2) highly down-regulated (expression ratio less than 0.2 in more than 
50% of the informative cases); (3) low expression (expression ratio between 0.2 and 5.0 in more 

15 than 50% of the informative cases); and (4) not expressed (or slight expression but under the cut- 
off level for detection). These categories were used to detect a set of genes whose changes in 
expression ratios were common among samples as well as specific to a certain subgroup. To 
detect candidate genes that were commonly up- or down-regulated in CML cells, the overall 
expression patterns of 23,040 genes were screened to select genes with expression ratios of more 

20 tiian 5.0 or less than 0.2 that were present in more than 50% of chronic phase of the CML cases 
categorized as (1), (2), or (3). 

Identification of genes with clinicallv relevant expression patterns in CML cells 

The expression patterns of approximately 23,000 genes in CML cells were examined 
using cDNA microarray. Individual data was excluded when both Cy5 and Cy3 signals were 
25 under cut-off values. The computational analysis identified commonly highly up-regulated or 
down-regulated genes in CML cells; 190 genes revealed the expression ratio of >5.0 in more 
than 50% of informative cases and 106 genes showed the egression ratio of <0.2 in more than 
50% of informative cases as down-regulated genes. 

One hundred ninety genes were found to be highly up-regulated. The upregulated genes 
30 included genes encoding proteins involved in cell cycle regulation, growth promotion, and 
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transcriptional activation and those having protein kinase activity. Many of them were shown to 
be over-expressed in other carcinomas. For example, MYB, a transcriptional activator that 
causes acute leukemia and transforms only hematopoietic cells (16), was highly expressed in 
over 90 % of the chronic phase of CML cells. GATA-binding protein 2 (GATA2), also a 
5 transcriptional activator which regulates endotherin-1 gene expression in endothelial cells (17), 
was reported to be activated in 93% of acute myeloid leukemia (AML), 70% of acute 
lymphoblastic leukemia (ALL), and 83% of CML (18). In particular, 28 genes for example, 
ribonuclease RNase A family 3, (RNASE3), bactericidal/permeability-increasing protein (BPl), 
defensin alpha 1, myeloid-related sequence (DEFAl), aminolevulmate, delta-synthase 1 

10 (ALASl), elastase 2, neutrophil (ELA2), cathepsin G (CTSG), matrix metalloprotease 9 (MMP- 
9), haptoglobin-related protein (HPR), urokinase plasminogen activator, (UPLA), haptoglobin 
(HP), H3 histone family, member J (H3FJ), and hemoglobiit, zeta (HBZ) were overexpressed in 
all of the informative samples in this study (see Table 3). MMP-9, an enzyme to degrade 
collagen type IV, is thought to be associated with the transmigration and degradation of the 

15 extracellular matrix structures of tissue and blood vessels. The expression of MMP-9 was 

enhanced in mononuclear cells of CML patients (19). Moreover, primary human Ph+ cells were 
reported to secrete various angiogenesis factors including MMP-9 (20). Thxis, overexpression of 
MMP-9 might play an important role in the pathogenesis of CML. Furthermore, members in the 
hemoglobin family, for example, zeta (HBZ), beta (HBB), gamma G (HBG2), delta (HDD), and 

20 alpha 2 (HBA2) were overexpressed in more lhan 80% of the informative cases. In addition, 
haptoglobm (HP) and haptoglobm-related protein (HPR) also showed enlianced expression in all 
informative cases. Recent studies have shown that bcr/abl expression induced hemoglobin (Hb) 
production in HL-60/BCR-Abl cells or CML cells (21), (22). This suggests tiiat tiie 
constitutively activated tyrosine kinase bcr-abl enhanced survival and expansion of 

25 hematopoietic progenitor cells. 

One hundred six genes were foimd to be significantiy down-regulated m the chronic 
phase of CML {see Table 4). The genes of known function included the SH3-domain GRB2-line 
2 (SH3GL2), PCAF associated factor 65 beta (PAF65B), heparan sulfate 6-0-sulfotransferase 
(HS6ST), immunoglobulin heavy constant gamma 3 (IGHG3), heat shock 27kD protein 2 
30 (HSPB2), and prostaglandm D synthase gene (PTGDS) genes whose e>!pression was suppressed 
in more than 90% of informative cases. A number of transcriptional negative regulators like 
DNA-dependent protein kinase catalytic subunit-interacting protein 2 (KIP2) were also included. 
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KIP2 is a negative regulator of cell proliferation and arrests cells at the Gl phase. K1P2 was 
down-regulated in approximately 60% of the informative CML cases. Therefore, its down- 
regulation may confer continuous prolifemtive properties of leukemic cells. 

Some of the significantly down-regulated genes reflect the difference in the cell types, 
the lymphocyte-specific genes such as genes encoding immune components, e.g., 
immunoglobulins, and complement component 2 (C2) as well as the markers for lymphocytes 
such as CD7, CD3E, CD79A, CD3Z, CD6, CD4, and CD79B antigens, interferon regulatory 
factor 4 and interleukin 7 receptor ^7R) (see Table 4). The majority of the cells 

contained in the chronic phase of CML cells used in this study corresponded to blast cells of 
myeloid lineage. Although the peripheral white cell used as universal controls contained cells of 
both myeloid and lymphoid lineage, lymphocytes accounted for only a small population of the 
cells in the sample. This mi^t represent that the population of lymphocyte in CML patients 
decreased compared with blood of healthy individuals. 



TABLE3 UP-REGULATED GENES 



CML As ratio(%) 
sigmnent 1) 


infoimad ratio>5 
ve cases 2) 


Accession 
No. 


Symbol 


Gene name 


1 


100 


22 


22 


X16545 


RNASE3 


ribonuclease, RNase A family. 














3 (eosinophil cationic protein) 


2 


100 


22 


22 


J04739 


BPI 


bactericidal/penneability-incre 














asing protein 


3 


100 


22 


22 


M21130 


DEFAl 


defensin, alpha 1, myeloid-rela 














ted sequence 


4 


100 


22 


22 


X56351 


ALASl 


aminolevulinate, delta-, syntbA 
se 1 


5 


100 


21 


21 


M16117 


CTSG 


cathepsin G 


6 


100 


21 


21 


M34379 


ELA2 


elastase 2, neutrophil 


7 


100 


20 


20 


J05070 


MMP9 


matrix metalloproteinase 9 ^e 














ladnase B, 92kD gelatinase, 92 














kD type IV collagenase) 


8 


100 


20 


20 


K03431 


HPR 


haptoglobin-related protein 


9 


100 


20 


20 


F21002 




ESTs 


10 


100 


19 


19 


X17042 


PRGl 


proteoglycan 1, secretory gran 
ule 


11 


100 


19 


19 


M14502 


ARGl 


arginase, liver 


12 


100 


18 


18 


L06895 


MAD 


MAX dimerization protein 


13 


100 


18 


18 


M69199 


G0S2 


putative lymphocyte GO/Gl s 














witch gene 


14 


100 


17 


17 


K01763 


HP 


h^toglobin 
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15 


100 


16 


16 


ffi3213 




ESTs 


16 


100 


16 


16 


Z98744 


H3FJ 


H3 histone femily, member J 


17 


100 


15 


15 


M24173 


HBZ 


hemoglobin^ zeta 


IS 


100 


15 


15 


X02419 


PLAU 


plasminogen activator^ urokina 


19 


100 


15 


15 


H48537 




se 

ESTs 


20 


100 


15 


15 


AA191449 KIAA1254KIAA1254 protein 


21 


100 


15 


15 


AI022380 




ESTs 


22 


100 


15 


15 


T03595 




Homo sapiens cDNA FLJ1268 














8 fis, clone NT2RM4002534 


23 


100 


14 


14 


X57129 


H1F2 


HI histone family, member 2 


24 


100 


14 


14 


AA382564 




ESTs 


25 


100 


13 


13 


R26792 


GCL 


grancalcin 


26 


100 


12 


12 


AA815247 




EST 


27 


100 


11 


11 


M81637 


GCL 


grancalcin 


28 


100 


11 


11 


AA44o44y 




xiol 


29 


95.45 


22 


21 


M83202 


LTF 


lactotransferrin 


30 


95.45 


22 


21 


V00497 


HBB 


hemoglobin, beta 


31 


95.45 


22 


21 


U01317 


HBD 


hemoglobin, delta 


32 


95.24 


21 


20 


AA489915 


HBG2 


hemoglobin, gamma G 


33 


95 


20 


19 


AI023753 




ESTs 


34 


94.74 


19 


18 




T CVD 




35 


93.75 


16 


15 








36 


93.33 


15 


14 








37 


91.67 


12 


11 






XUiiiUlC NLimmaLin^ IluruiUIlC iC 














ceptor ? 


3S 


91.67 


12 


11 


AA398536 




ESTs 


39 


91.67 


12 


11 


AA843554 




ESTs 


40 


90 91 


22 


20 


S81914 


IER3 


inGonediate early response 3 


41 


90 91 


22 


20 


M27717 


CPA3 


carboxypeptidase A3 (mast eel 


42 


90.91 


22 


20 


U22376 


MYB 


1) 

v-myb avian myeloblastosis vi 














ral oncogene homolog 


43 


90.91 


11 


10 


U32315 


SIXSA 


syntaxin 3 A 


44 


90.91 


11 


10 


AA774546 


NXF3 


nuclear RNA export factor 3 


45 


90.48 


21 


19 


AI015633 




Solute carrier family 26, mem 














ber8 


46 


89.47 


19 


17 


M33987 


CAl 


carbonic anhydrase I 


47 


88.89 


18 


16 


AA004412 




ESTs 


48 


88.89 


18 


16 


AI056326 




ESTs 


49 


88.24 


17 


15 


AA825819 LOC55871 COB W-like protein 


50 


86.67 


15 


13 


X65614 


SIOOP 


SIOO. calcium-binding protein 



p 
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51 


86.67 


15 


13 


L27711 


CDKN3 cyclin-dependent kinase inhibi 












tor 3 (CDK2-associated dual s 












pecificity phosphatase) 


52 


86.67 


15 


13 


AA418448 


ESTs 


53 


86.67 


15 


13 


AA235222 LOC51053 geminin 


54 


86.36 


22 


19 


M62831 


ETRlOl immediate early protein 






99 


10 


L01664 


CLC Chaiot-Leyden crystal protein 


56 


86.36 


22 


19 


V00493 


HBA2 hemoglobin, alpha 2 


57 


8636 


22 


19 


X00637 


HP haotoslobin 


58 


86.36 


22 


19 


W63676 


ESTs 


59 


85.71 


21 


18 


U01317 


HBD hemofflohin delta 


60 


85.71 


14 


12 

XAt 




r>lCF7P4'^4nK'F7P4^4ni9^ nrntMn 












ni9^ 


fit 


ft^ 71 
OD,/ X 




19 




TTT^I? 1 T ^ Al/k«vo-n«\ 1«L-A ^ /■m'cm-pA*.! 


62 


85 


20 


17 


7MR91 


AT A ^9 QtYii'ttnl^imliTiJif'A /IaI^'Q^ cinrfVia 
r\±^t\AJJi0 <X\l }\ limc V umialC^ tlCIul'*^ oyilUla. 












se 2 f sid^eroWa<?tic/hvnficliiY>Tni 

X« \ifX\A\^X\/\JXW>\^\^tXXyyj\J\^XXXKfXX»X 












c anemia) 


63 


85 


20 


17 


AA3 18275 


FTHl Ferritin, heavy polypeptide 1 


64 


84.62 


13 


11 


M16827 


AC ADM acyl-Coenzyme A dehydrogen 












ase» C-4 to C2 straight chain 


65 


83.33 


12 


10 


AA355657 


CTSG cathepsinG 


66 


81.82 


22 


18 


D14874 


ADM adrenomedidlin 


67 


81.82 


22 


18 


M69043 


NFKBIA nuclear factor of kappa light p 












olypeptide gene enhancer in 












BK^ells inhibitor, alpha 


68 


81.82 


22 


18 


X55656 


HBG2 hemoglobin, gamma G 


^0 
\jy 


O i.OZ 


99 


1 s 


X68277 


DUSPl dual specificity phosphatase 1 


/U 


01 fiO 

oi.oZ 




1 o 
lo 


W76477 


JUN v-jun avian sarcoma virus 17 o 












ncogene homolog 


71 


o i*o^ 


99 


15^ 


X56351 


ALASl aminolevulinate, delta-, syntfaa 

se 1 


79 


ftl k9 


1 1 

X i 


0 


AI168658 


FECH ferrochelatase (protoporphyri 


73 


81 82 


11 


Q 


AA256671 FU21939 hypothetical protein FU21939 












similar to 5-azacytidine indue 












ed gene 2 


74 


80 


15 


12 


AI819724 


COLlAl collagen, type I, alpha 1 


75 


78.95 


19 


15 


M96839 


PRTN3 protemase 3 (serine proteinase. 












neutrophil, Wegener granulo 












matosis autoantigen) 


76 


78.95 


19 


15 


R78436 


GATA2 GATA-binding protein 2 


77 


78.57 


14 


11 


AA449950 KIAA1016KIAA1016 protein 


78 


77.78 


18 


14 


AA456242 


FSP-2 fibrousheathinll 


79 


77.78 


18 


14 


AD57641 


CD'KN2C cyclin-dependent kinase inhibi 
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tor 2C (pl8, inhibits CDK4) 



80 


7778 


18 


14 


M13692 


ORMl 


orosomucoid 1 


Rl * 

Ol 


77 7ft 


1ft 


1A 


D90145 


SCYA3L1 small inducible cytokine A3-U 

L-zv 1 


CO 


/ /Jo 


1 Q 


1 A 

14 


AA903016 


HM74 


piitative chemokine receptor; . 














GTP-binding protein 


83 


77 27 


22 


17 


AI128538 L0C5 13 12 mitochondrial solute carrier 


R4 
ot 


7/x 07 




in 




HCRTR2 hypocretin(orexin) receptor 2 




76.47 


17 


13 


Lll J / 


CYP11B2 cytochrome P450, subfamily 














XIB (steroid 1 1-beta-hydroxyl 






17 








ase), polypeptide 2 


86 


76.47 


13 




ALS2CR2 Amyotrophic lateral sclerosis 














2 Ouvenile) chromosome regio 














n, candidate 2 


87 


76.47 


17 


13 






ESTs 


oo 


7/5 10 


71 


lO 




C8FW 


Phosphoprotein regulated by 














mitogenic pathways 


89 


75 


20 


15 


M33492 


TPSBl 


tryptase beta 1 


90 


75 


20 


IS 


AF043584 


IGL 


inmiunoglobulin lambda chain 


91 


75 


1/i 


19 


AI000650 




ESTs 




/3 


1 

lo 


12 


AW337343 


PTP4A1 


protein tyrosine phosphatase t 














ype IVA, member 1 




7'^ /5ft 


10 


Izt 


AA043835 


DAPPl 


dual adaptor of phosphotyrosi 














ne and 3-phosphoinositides 


94 


73 33 


1^ 


1 1 


AA449227 




EST 




77 7^ 




lo 


U21847 


TEG 


TGFB inducible early growth r 














e^nse 




79 7^^ 


1 1 
1 1 


o 
o 


U77942 


STX7 


syntaxin7 


07 


77 7*1 


1 1 
1 1 


o 
o 


AA345854 


ITGA3 


integrin, alpha 3 (antigen CD4 














9C, alpha 3 subunit of VLA-3 














receptor) 


98 


72.73 


11 


8 


AA3 14457 LOC56994cholinephosphotransferase 1 


99 


iim 


11 


8 


AI718618 


BIRC2 


baculoviral lAP repeat-contain 














ing2 


100 




11 


g 


AI276054 


FRAT2 


Frequently rearranged in adva 














need T-cell lymphomas 2 


101 


1232 


18 


13 


D86724 


ARG2 


arginase, type n 


102 


12.22 


18 


13 


X06233 


S100A9 


SI 00 calcium-binding protein 














A9 (calgranulin B) 


103 


72.22 


18 


13 


AA973757 


STX3A 


syntaxin3A 


104 


72.22 


18 


13 


AF068754 


HSBPl 


heat shock factor binding prote 
inl 


105 


71.43 


14 


10 


H97976 




ESTs 


106 


70.59 


17 


12 


D14657 


KIAAOlOl KIAAOlOl gene product 


107 


70 


20 


14 


AI056641 


FU22833 hypothetical protein FLJ22833 
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108 


70 


20 


14 


AA677931 


ESTs 


109 


69.23 


13 


9 


AB003476 


AKAP12 A kinase O^RKA'^ anchor orote 












in (gravin) 12 


110 


68.75 


16 


11 


AA854469 


RNF6 ring finger protein (C3H2C3 1 












ype)6 


111 


68.18 


22 


15 


AA327207 


ESTs 


112 


66.67 


21 


14 


AA772709 


Homo saoiens cDNA FLJ1352 












2fis clone PLACE1005884 


113 


66.67 


21 


14 


U46254 


ESTs 


114 


66.67 


21 


14 


AI087002 


ESTs 


115 


66.67 


18 


12 


AW237266 


ASAH N-acylsphingosine amidohydr 












olase (acid ceramidase) 


116 


66.67 


12 


8 


. H53099 


NDUFAIO NADH dehydrogenase (ubiqui 












none) 1 alpha subcomplex, 10 












(42kD) 


117 


66.67 


12 


8 


T03044 


EST 


118 


65 


20 


13 


W96110 


ZNF28 1 zinc finger protein 281 


119 


64.71 


17 


11 


AA629596 DKFZP564DKFZP564D177 protein 












D177 


120 


64.29 


14 


9 


M23161 


IHEl Human transposon-likeeleme 












ntmRNA 


121 


64^9 


14 


9 


D20186 


DKFZp762 hypothetical protein DKFZp76 












O076 2O076 


122 


6429 


14. 


0 

y 


H80325 


BAZIA bromodomain adjacent to zinc 












finger domain, lA 


123 


63.64 


22 


14 


U21847 


TIEG TGFB mducible early growth r 


124 










esponse 


63.64 


11 


7 


M31452 


C4BPA complement component 4-bin 












ding protein, alpha 


125 


63.64 


11 


7 


U39231 


GIPR gastric inhibitory polypeptide r 


126 










eceptor 


63.64 


11 


7 


AI262031 


ATPIOD ATPase, Class V, type lOD 


127 


63.64 


11 


7 


AF022385 


PDCDIO programmed cell death 10 


128 


63.64 


11 


7 


N58488 


EST 


129 


63.64 


11 


7 


AI016419 


ESTs 


130 


62.5 


16 


10 


H71303 


KIAA0481 BCIAA0481 gene product 


131 


62.5 


16 


10 


H06819 


FIJ10846 hypothetical protein FLJ10846 


132 


62.5 


16 


10 


W93000 


ESTs 


133 


62.5 


16 


10 


W37916 


HCF-2 host cell fector 2 


134 


61.9 


21 


13 


AF016833 


MGAM maltase-glucoamylase (alpha- 












glucosidase) 


135 


61.54 


13 


8 


X00948 


RLN2 relaxin2(H2) 


136 


61.54 


13 


8 


U55206 


GGH gamma-glutamyl hydrolase (c 



onjugase, folylpolygammaglut 
amyl hydrolase) 
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137 


61.54 


13 


8 


138 


61.54 


13 


8 


139 


60 


20 


12 


140 


60 


15 


9 


141 


60 


15 


9 


142 


59^09 


AA 

22 


13 


143 


Cf\ AA 

59.09 


22 


13 


144 


58.82 


17 


10 


145 


58.33 


12 


7 


146 


57.89 


19 


11 


147 


57.89 


19 


11 


148 


57.14 


21 


12 


149 


57.14 


21 


12 


150 


57.14 


14 


8 


151 


57.14 


14 


8 


152 


57.14 


14 


8 


1 CO 

153 


56.25 


16 


9 


154 


56.25 


16 


9 


155 


56.25 


16 


9 


156 


55 


20 


11 


157 


55 


20 


11 


158 


54.55 


22 


12 


159 


54.55 


22 


12 


160 


54.55 


11 


6 



8 AA308062 SIOOP 



AA731746 
AI078178 
M95809 



M93056 



LlOlOl 
W87690 

AK025906 



11 AA548765 



D49958 
AA778025 
AA047169 

L16464 
AF011468 
AI188389 

AA847136 CSF2RB 

AI025297 
M60974 
X56351 
U93869 



SlOO calcium-binding protein 
P 

ESTs 
ESTs 

GIKZHl genaal transcription factor II 
H, polypeptide 1 (62kD subuni 

t) 

ESTs 

SERPDSIBI serine (or cysteine) proteinase 
inhibitor, clade B (ovalbumin), 
member 1 
IER5 Immediate early response 5 
SRY sex determining region Y 

Homo sapiens cDNA: FLJ231 
73 fis, clone LNG10019 
Homo sapiens cDNA: FLJ222 
53 fis, clone HRC02763 

SMARCBl SWI/SNF related, matrix assoc 
iated, actin dependent regulato 
r of chromatin, subfamily b, m 
ember 1 
ESTs 

Homo sapiens cDNA FLJl 1 83 
8 fis, clone HEMBA1006624, 
weakly similar to DNA/PANT 
OTHENATE METABOLISM 
FLAVOPROTEIN HOMOLO 
G 

GPM6A glycoprotein M6A 
ESTs 

Homo sapiens cDNA: FU227 
56 fis, clone KAIA0791 
ETV3 ets variant gene 3 
STKl 5 serine/threonine kinase 1 5 
CI 10RF15 chromosome 1 1 open reading f 
rame 15 

Colony stimulating factor 2 re 
ceptor, beta, low-affinity (gran 
ulocyte-macrophage) 
KLF7 Kruppel-like factor 7 (ubiquito 
us) 

GADD45A growth arrest and DNA-damag 
e-inducible, alpha 
ALASl aminolevulinate, delta-, syntha 
se 1 

RPC39 polymerase (RNA)ni(DNAd 
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irected)(39kD) 



161 


54.55 


11 


6 


AA551628 FLJ10260 hypothetical protein FLJ10260 


162 


54.55 


11 


6 


AI268502 


ESTs 


163 


54.55 


11 


6 


AI768505 KIAA0707KIAA0707 protein 






1 1 


o 


W70293 


ESTs 


165 


53.85 


13 


7 


M28443 


AMY2A amylase, alpha 2 A; pancreatic 


166 


53.85 


13 


7 


T 77571 


DGS-A DiOeorce svndrome cene A 


167 


53.85 


13 


7 


A A 1 AlOOO 




168 


53.33 


15 


8 


X51699 


BGLAP bone gamma-carboxyglutamat 












e (gla) protein (osteocalcin) 


169 


53 33 


15 


g 


R86067 


ESTs, Weakly similar to KIA 












A1353 protein [H.sq>iens] 


170 


53 33 


15 

A*/ 


s 


U57961 


13CDNA7 putative gene product 
3 


171 

X # J. 


53 33 


15 


8 


AI025822 


EST 


1 /Z 






Q 
O 


AI150469 


ESTs 


173 


CO C%A 


1 n 
If 


A 


M23204 


OAT ornithine aminotransferase (gy 












rate atrophy) 


174 


52 63 


19 


10 


L04270 


LTBR lymphotoxin beta receptor (T 












NFR superfamily, member 3 


17^ 




19 


10 


AW511361 SLC29A1 solute earner family 29 (nucle 












oside transporters), member 1 


176 


52.63 


19 


10 


AA027229 


ESTs, Weakly similar to F45 












E12.5 [C.elegans] 


177 


52.63 


19 


10 


AI201982 


Homo sapiens cDNA FLJl 151 












6 fis, clone HEMBA1002328 


178 


52.38 


21 


11 


AA706319 


ESTs 


179 


52.38 


21 


11 


W05688 


ESTs 


180 


50 


20 


10 


X56741 


MEL mel transforming oncogene (d 












erived from cell line NK14)- R 












AB8 homolog 


181 


50 


18 


9 


AI032402 


ESTs 


182 


50 


16 


8 


F10728 


ESTs 


183 


50 


16 


8 


AA576399 


ESTs 


184 


50 


16 


8 


T36260 


SEC23B Sec23 (S. cerevisiae) homolog 
B 


185 


50 


16 


8 


AJ250075 


CHI membrane protein CHI 


186 


50 


12 


6 


All 87066 


ESTs 


187 


50 


12 


6 


AA586974 


PI3 protease inhibitor 3, skin-deriv 












ed (SKALP) 


188 


50 


14 


7 


AA639599 SLC12A2 solute carrier family 12 (sodiu 












m/potassium/chloride transpor 












ters), member 2 


189 


50 


14 


7 


AA807551 


ESTs 


190 


50 


14 


7 


AA503224 


ESTs 
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Accession numbers and gene symbols were retrieved from the Unigene Databases (buil<^13 1). 



TABLE4 DOWN-REGULATED GENES 



CML ratio(%) infonnad ratio<5 Accession 



dgmnent 


1) 


ve cases 


2) 


No. 


1Q1 


1 \J\J 


16 




AFO'^6268 


192 


94.74 


19 


18 


AF069736 


193 


93.75 


16 


15 


AB006179 


194 


93.33 


15 


14 


AA806043 


195 


90.91 


11 


10 


D89617 


196 


90.48 


21 


19 


M61900 


197 


88.24 


17 


15 


M73780 


198 


88.24 


17 


15 


M60450 


199 


87.5 


16 


14 


U28369 


200 


86.67 


15 


13 


M13149 


201 


86.67 


15 


13 


AI089023 


202 


86.36 


22 


19 


D00749 


203 


85 


20 


17 


M88468 


204 


84.62 


13 


11 


AF040723 


205 


84.21 


19 


16 


M24405 


206 


82.35 


17 


14 


M81886 . 


207 


82.35 


17 


14 


M25809 


208 


81.82 


22 


18 


R72651 


209 


81.82 


22 


18 


X97229 


210 


81.82 


22 


18 


D87465 



Symbol 



Gene name 



SH3GL2 
PAF65B 
HS6ST 

IGHG3 

HSPB2 
PTGDS 
ITGB8 
KCNA4 

SEMA3 
B 

HRG 
FXYD7 

CD7 
MVK 

HAPl 

TCF3 



GRIAl 
ATP6B1 



KIR2DL 
4 

KIAA02 
75 



SH3-domain GRB24ike 2 
PCAF associated factor 65 beta 
heparan sulfate 6-0-sulfotransfera 
se 

immunoglobulin heavy constant g 
amma 3 (G3m marker) 
heat shock 27kD protein 2 
prostaglandin D synthase gene 
integrin, beta 8 

potassium voltage-gated channel, s 
haker-related subfamily, member 4 
sema domain, immunoglobulin do 
main (Ig), short basic domain, seer 
eted, (semaphorin) 3B 
histidine-rich glycoprotein 
FXYD domain-containing ion tran 
sport regulator 7 
CD7 antigen 

mevalonate kinase (mevalonic aci 
duria) 

huntingtin-associated protem 1 (ne 
uroan 1) 

transcription fector 3 (E2 A immun 
oglobulin enhancer binding factors 
E12/E47) 

glutamate receptor, ionotropic, A 
MPAl 

ATPase, H+ transporting, lysosom 
al (vacuolar proton pump), beta po 
lypeptide, 56/58kD, isoform 1 
ESTs, Weakly similar to PLK^H 
UMAN PROTEOGLYCAN LINK 
PROTEIN PRECURSOR |H.sapi 
ens] 

killer cell immunoglobulin-like rec 
eptor, two domains, long cytoplas 
mic tail, 4 

KIAA0275 gene product 
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211 


81.82 


11 


9 


S77410 


212 


81.82 


11 


9 


U52112 


213 


. 81.25 


16 


13 


U18468 


214 


80 


20 


16 


D29990 


215 


80 


15 


12 


M23323 


216 


78.95 


19 


15 


M30607 


217 


78.95 


19 


15 


M98833 


218 


77.27 


22 


17 


X67292 


219 


76.92 


13 


10 


AA543086 


220 


75 


20 


15 


L13258 


221 


75 


12 


9 


D00174 


222 


73.68 


19 


14 


U05227 


223 


73.33 


15 


11 


U79240 


224 


73.33 


15 


11 


X07994 


225 


72.73 


22 


16 


M80462 


226 


71.43 


21 


15 


J04132 


227 


70.59 


17 


12 


T Tf\n ^ AC 

U97145 


228 


69.23 


13 


9 


U05321 


229 


69.23 


13 


9 


M24405 


230 


69.23 


13 


9 


U02619 


231 


68.75 


16 


11 


J04599 


232 


68.42 


19 


13 


X80818 



AGTRl angiotensin receptor 1 
ARDl N-acetyltramferase,homologofS. 

cerevisiaeARDl 
PSG4 pregnancy specific beta-glycoprote 
in4 

SLC7A2 solute carrier family 7 (cationic a 
mino acid transporter, y+ system), 

member 2 

CD3E CD3E antigen, epsilon polypeptide 



ZFY 
FLU 

IGHM 



SLC34A 
1 

SERPIN 
F2 



SEC4L 



LCT 
CD79A 

CD3Z 

GFRA2 
SLCI6A 
2 

TCF3 



GTF3C1 



BGN 
GRM4 



zinc finger protein, Y-linked 
Friend leukemia virus integration 
1 

immunoglobulin heavy constant m 
u 

Homo sapiens cDNA: FLJ23270 fi 
s, clone COL10309, highly similar 
to HSU33271 Human nomial kera 
tinocyte mRN A 

solute carrier family 34 (sodium p 
hosphate), member 1 
serine (or cysteinb) proteinase inhi 
bitor, clade F (alpha-2 antiplasmin, 
pigment epithelium derived facto 
r), member 2 

GTP-binding protein homologous 
to Saccharomyces cerevisiae SEC 
4 

Human PAS-serine/threonine kina 

se mRNA, partial cds 

lactase 

CD79A antigen (immunoglobulin- 
associated alpha) 

CD3Z antigen, zeta polypeptide (T 
iT3 complex) 

GDNF family receptor alpha 2 
solute carrier family 16 (monocarb 
oxylic acid transporters), member 
2 Q)utative transporter) 
transcription factor 3 (E2 A immun 
oglobulin enhancer binding factors 
E12/E47) 

general transcription factor HIC, p 
olypeptide 1 (alpha subunit, 220k 
D) 

biglycan 

glutamate receptor, metabotropic 4 
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233 


68.42 


19 


13 


X70991 


NAB2 NGFI-A binding protein 2 (ERGl 












binding protein 2) 


234 


68.42 


19 


13 


J00269 


KRT6A keratin6A 


235 


68.18 


22 


15 


AA778161 


RPL26 ribosomal protein L26 


236 


66.67 


18 


12 


AI123516 


ESTs 


237 


66.67 


15 


10 


U34623 


CD6 CD6 antigen 




00.0/ 


12 


0 


AA421322 


IGL@ immunoglobulin lambda locus 


239 


66.67 


12 


8 


M91196 


ICSBPl raterferon consensus sequence bin 












ding protein 1 




00.0/ 


12 


Q 


M12807 


CD4 CD4 antigen (p55) 


241 


66.67 


12 


8 


D52745 


KIAA08 lectomedin-2 
21 


242 


64.29 


14 


A 

9 


S82807 


TSHR thyroid stimulating hormone recep 


24^ 


04.zy 


1 A 
14 


0 


AI027554 DKFZP5DKFZP586J1624 protein 












86J1624 


Z4rr 


Oj.04 


ZZ 


JL^f 


U52682 


IRF4 interferon regulatory factor 4 


Z4D 


0j.04 


00 
ZZ 


1 A 
14 


X03066 


HLA-DO major histocompatibUily complex. 












B class n, DO beta 


Z*fO 


0^.04 


00 

zz 


14 


M15800 


MAL mal» T-cell differentiation protein 


OAT 
24/ 


0J.04 


00 

2Z 


1 /I 
14 


AB66242 


ESTs 


248 


63.64 


22 


14 


X54101 


GNLY gjranulysin 


249 


63.64 


22 


14 


T04932 


Homo sapiens cDNA: FLJ21545 fi 












s, clone COL06i95 


OCA 


63.64 


22 


1 A 

14 


AI248183 


PAX5 Paired box gene 5 (B-cell lineage s 












pecific activator protein) 






1 1 
1 i 


7 


AF064804 SUPT3H suppressor of Ty (S.cerevisiae) 3 h 












omolog 


252 


63.64 


11 


7 


D28769 


PBX2 pre-B-cell leukemia transcription f 












actor 2 


253 


63.64 


11 
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Accession numbers and gene symbols were retrieved £ix)m the Unigene Databases (bmld#13 1). 



Confirmation bv semi-quantitative RT-PCR 

To confirm the reliability of the expression differences indicated by microarmy analysis, 
semi-quantitative RT-PCR experiments were performed for the 1 1 highly up-regulated genes in 

5 aUofthe informative samples (RNASE3,CTSG,MMP9, HP, 

KIAA1254, and two ESTs (Accession No. H23213 and H48537). A 3-ng aUquot of aRNA fix>m 
each sample was reverse-transcribed for single-stranded cDNAs using random primer (Roche) 
and Superscript n (Life Technologies, Inc.)- Each cDNA mixture was diluted for subsequent 
PCR amplification with the same primer sets that were prepared for the target DNA- or p-actin- 

10 specific reactions. The primer sequences are listed in Table 2. Expression of p-actin served as 
an intemal control. PCR reactions were optimized for the number of cycles to ensure product 
intensity within the linear phase of amplification. The RT-PCR results were highly concordant 
to those of the microarray analysis in the great majority of the tested cases i^see Fig. 1). These 
data verified the reliability of our strategy to identify commonly up-regulated genes in CML 

15 cells. 



Table 2 Primer sequences for semi-quantitative RT-PCR experiments 

?^ Accession i. i ^ • SEQ.ID. ^ . SEQ.I 

Assign Symbol Forward prrnier ^ Reverse prmaer 

ment * ' 

5*-GTTCCAAAA 5*-GGTATGGAGA 

1 X16545 KNASE3CTGTTCACTTCC No.l CTGATGAGGACA No.2 

C-3* G-3' 

5'-CTTCTGCTGG 5'-TGTGGACGTT 

5 M16117 CTSG CCTTTCTCCTA No.3 TATTAAGGCTCT No.4 

C-3* G-3' 
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7 J05070 
K03431 

g 

15 H23213 
Z98744 

16 



MMP9 
HPR 

EST 
H3FJ 



17 M24173 HBZ 



18 X02419 PLAU 



19 H48537 
20 



EST 

KIAA12 
54 



57 K01763 HP 
V00478 ACTB 



5'-GAACCAGCT 
GTATTTGTTCA No.5 

AGG-3' 
5'-TCCTGAATGT 
GAAGCAGTATG No.7 

TG-3' 
5'-GTCCCAAGA 
TGCATATTTTCC No.9 
T-3' 

5'-ACTTTCTGAC 
TTAGGCCACAG No.ll 

GT-3' 
5'-TCTCTGACCA 
AGACTGAGAGG No.l3 

AC-3' 
5'-CAGTCACAC 
CAAGGAAGAG No.l5 

AATG-3' 
5'-GTGTGATTAT 
CAAAAGGGAGT No.l7 

GG-3' 
5'-TCCTACTTTG 
GCCAAGTTTGT No.l9 
T-3' 

5'-AAGGAGATG 
GAGTGTACACC No.21 

TTAAA-3' 
5'-CATCCACGA 
AACTACCTTCA No.23 
ACT-3' 



5'-AAAACAAAGG 
TGAGAAGAGAG No.6 

GG-3' 
5'-AGCCTTGCAT 
TAGTTCTCAGCT No.8 

A-3' 

5'-CCGAGCCCAT 
TAATACTGATAG No.lO 
A-3' 

5'-ACAGAGTGCT 
CAGTTCTTCCGT No.l2 
A-3' 

5'-GAGGATACGA 
CCGATAGGAACT No.l4 
T-3' 

5'-CAGTGAGGAT 
TGGATGAACTAG No.16 
G-3' 

5'-AATAGTGCCT 
ATTTAAGGCCG- No.18 
3' 

5'-ACTAAGCTGG 
TACATGGAATGG No.20 

A-3' 

5'-TGATTGACTC 
AGCAATGCAGG- No.22 
3' 

5'-TCTCCTTAGA 
GAGAAGTGGGG No^4 
TG-3' 



Indiistrial Applicability 

The ^ne-expiession analysis of CML described herein, obtained through a combination 
of laser-capture dissection and genome-wide cDNA microarray, has idoitified specific genes as 
s targets for cancer prevention and therapy. Based on the expression of a subset of these 

differentially expressed genes, the present invention provides a molecular diagnostic markers for 
identifying or detecting CML. 

The methods described herein are also useful in the identification of additional molecular 
targets for prevention, diagnosis and treatment of CML. The data reported hereia add to a 
10 comprehensive understanding of CML, facilitate development of novel diagnostic strategies, and 
provide clues for identification of molecular targets for therapeutic drugs and preventative agents. 
Such information contributes to a more profound understandii^ of tumorigenesis of CML, and 
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' provide indicators for developing novel strategies for diagnosis, treatment, and ultimately 
prevention of CML. 

All patents, patent applications, and publications cited herein are incorporated by 
reference in their entirety. Furthermore, while the invention has been described in detail and 
5 with reference to specific embodiments thereof, it will be ^parent to one skilled in the art that 
various diai^es and modifications can be made therein vdthout d^)axting fiom the spirit and 
scope of the invention. 
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